SEQUENCE LISTING 



(1) GENERAL INFORMATION 

(i) APPLICANT: Yelton, Dale E . 

Rosok, Mae Joanne 

(ii) TITLE OF THE INVENTION: A METHOD FOR INHIBITING IMMUNOGLOBULIN- 
INDUCED TOXICITY RESULTING FROM THE USE OF IMMUNOGLOBULINS IN 
THERAPY AND IN VIVO DIAGNOSIS 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Merchant, Gould, Smith, Edell, Welter & Schmidt 

(B) STREET: 11150 Santa Monica Boulevard, Suite 400 

(C) CITY: Los Angeles 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP : 90025 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/905,293 

(B) FILING DATE: 01-AUG-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 60/023,033 

(B) FILING DATE: 02-AUG-1996 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Canady, Karen S 

(B) REGISTRATION NUMBER: 39', 927 

(C) REFERENCE/DOCKET NUMBER: 30436. 43USU1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 310-445-1140 

(B) TELEFAX : 310-445-9031 

(C) TELEX: 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TGGCACCGAA AGCTTTCTGG GGCAGGCCAG GCCTGA 3 6 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
TCCGGACATG TTGGTACCCA CGTGGTGGTC GACGCTGAGC CTGGCTTCGA GCAGACA 57 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GTCGACCACC ACGTGGGTAC CAACATGTCC GGAGCCACAT GGACAGAGGC CGGCT 55 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CTGGTTCTTG GTCATCTCCT CTCTAGATGG 3 0 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



ACCATGGTCG ACCTCAGACC TGCCAAGAGC CATATC 



36 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CATGGTCACG TGGTGTGTCC CTGGATGCAG GCTACTCTAG 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CAGGGAGGGA GGGTGTCTGC TGGAAGCCAG GCTCAGCGCT GACCTCAGA 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGAAAGAACC ATCACAGTCT CGCAGGGGCC CAGGGCAGCG CTGGGTGCTT 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GACGGATCGG GAGATCTGCT AGGTGACCTG AGGCGCGCCG GCTTCGAATA GCCAGAGTAA 
CCTTTTTTTT TAATTTTATT TTATTTTATT TTTGAGATGG AGTTTGGCGC CGATCTCCCG 
ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGTATC 
TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT TAAGCTACAA 
CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG CGTTTTGCGC 



TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT AGTTATTAAT 360 

AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 420 

TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA 4 80 

TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGACT 54 0 

ATTTACGGTA AACTGCCCAC TTGGCAGTAC * ATCAAGTGTA TCATATGCCA AGTACGCCCC 600 

CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT 660 

GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 720 

GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC 780 

TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA 840 

AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG 900 

TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTACTGG CTTATCGAAA 960 

TTAATACGAC TCACTATAGG GAGACCCAAG CTTGGTACCA ATTTAAATTG ATATCTCCTT 1020 

AGGTCTCGAG TCTCTAGATA ACCGGTCAAT CGATTGGAAT TCTTGCGGCC GCTTGCTAGC 10 80 

CACCATGGAG TTGTGGTTAA GCTTGGTCCT TCCTTGTCCT TGTTTTAAAA GGTGTCCAGT 114 0 

GTGAAGTGAA TCTGGTGGAG TCTGGGGGAG GCTTAGTGCA GCCTGGAGGG TCCCTGAAAG 1200 

TCTCCTGTGT AACCTCTGGA TTCACTTTCA GTGACTATTA CATGTATTGG GTTCGCCAGA 1260 

CTCCAGAGAA GAGGCTGGAG TGGGTCGCAT ACATTAGTCA AGGTGGTGAT ATAACCGACT 1320 

ATCCAGACAC TGTAAAGGGT CGATTCACCA TCTCCAGAGA CAATGCCAAG AACACCCTGT 13 80 

ACCTGCAAAT GAGCCGTCTG AAGTCTGAGG ACACAGCCAT GTATTACTGT GCAAGAGGCC 144 0 

TGGACGACGG GGCCTGGTTT GCTTACTGGG GCCAAGGGAC TCTGGTCACG GTCTCTGTAG 1500 

CTAGCACCAA GGGCCCATCG GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG 1560 

GCACAGCGGC CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 1620 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA CAGTCCTCAG 1680 

GACTCTACTC CCTCAGCAGC GTGGTCACCG TGCCCTCCAG CAGCTTGGGC ACCCAGACCT 174 0 

ACATCTGCAA CGTGAATCAC AAGCCCAGCA ACACCAAGGT GGACAAGAAA GTTGGTGAGA 1800 

GGCCAGCACA GGGAGGGAGG GTGTCTGCTG GAAGCCAGGC TCAGCGCTCC TGCCTGGACG 1860 

CATCCCGGCT ATGCAGCCCC AGTCCAGGGC AGCAAGGCAG GCCCCGTCTG CCTCTTCACC 1920 

CGGAGGCCTC TGCCCGCCCC ACTCATGCTC AGGGAGAGGG TCTTCTGGCT TTTTCCCCAG 1980 

GCTCTGGGCA GGCACAGGCT AGGTGCCCCT AACCCAGGCC CTGCACACAA AGGGGCAGGT 2 04 0 

GCTGGGCTCA GACCTGCCAA GAGCCATATC CGGGAGGACC CTGCCCCTGA CCTAAGCCCA 2100 

CCCCAAAGGC CAAACTCTCC ACTCCCTCAG CTCGGACACC TTCTCTCCTC CCAGATTCCA 2160 

GTAACTCCCA ATCTTCTCTC TGCAGAGCCC AAATCTTGTG ACAAAACTCA CACATGCCCA 2220 

CCGTGCCCAG GTAAGCCAGC CCAGGCCTCG CCCTCCAGCT CAAGGCGGGA CAGGTGCCCT 22 80 

AGAGTAGCCT GCATCCAGGG ACAGGCCCCA GCCGGGTGCT GACACGTCCA CCTCCATCTC 234 0 

TTCCTCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC CTCTTCCCCC CAAAACCCAA 24 00 

GGACACCCTC ATGATCTCCC GGACCCCTGA GGTCACATGC GTGGTGGTGG ACGTGAGCCA 24 60 

CGAAGACCCT GAGGTCAAGT TCAACTGGTA CGTGGACGGC GTGGAGGTGC ATAATGCCAA 2520 

GACAAAGCCG CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG TCCTCACCGT 2580 

CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA ACAAAGCCCT 2640 

CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGT GGGACCCGTG GGGTGCGAGG 2700 

GCCACATGGA CAGAGGCCGG CTCGGCCCAC CCTCTGCCCT GAGAGTGACC GCTGTACCAA 2760 

CCTCTGTCCC TACAGGGCAG CCCCGAGAAC CACAGGTGTA CACCCTGCCC CCATCCCGGG 2820 

ATGAGCTGAC CAAGAACCAG GTCAGCCTGA CCTGCCTGGT CAAAGGCTTC TATCCCAGCG 28 80 

ACATCGCCGT GGAGTGGGAG AGCAATGGGC AGCCGGAGAA CAACTACAAG ACCACGCCTC 2940 

CCGTGCTGGA CTCCGACGGC TCCTTCTTCC TCTACAGCAA GCTCACCGTG GACAAGAGCA 3 000 

GGTGGCAGCA GGGGAACGTC TTCTCATGCT CCGTGATGCA TGAGGCTCTG CACAACCACT 3 060 

ACACGCAGAA GAGCCTCTCC CTGTCTCCGG GTAAATGAGT GCGACGGCCG GCAAGCCCCC 3120 

GCTCCCCGGG CTCTCGCGGT CGCACGAGGA TGCTTGGCAC GTACCCCCTG TACATACTTC 3180 

CCGGGCGCCC AGCATGGAAA TAAAGCACCC AGCGCTGCCC TGGGCCCCTG CGAGACTGTG 324 0 

ATGGTTCTTT CCACGGGTCA GGCCGAGTCT GAGGCCTGAG TGGCATGAGG GAGGCAGAGC 33 00 

GGGTCCCACT GTCCCCACAC TGGCCCAGGC TGTGCAGGTG TGCCTGGGCC CCCTAGGGTG 3360 

GGGCTCAGCC AGGGGCTGCC CTCGGCAGGG TGGGGGATTT GCCAGCGTGG CCCTCCCTCC 3420 

AGCAGCACCT GCCCTGGGCT GGGCCACGGG AAGCCCTAGG AGCCCCTGGG GACAGACACA 34 80 

CAGCCCCTGC CTCTGTAGGA GACTGTCCTG TTCTGTGAGC GCCCCTGTCC TCCCGACCTC 354 0 

CATGCCCACT CGGGGGCATG CCTAGTCCAT GTGCGTAGGG ACAGGCCCTC CCTCACCCAT 3600 

CTACCCCCAC GGCACTAACC CCTGGCTGCC CTGCCCAGCC TCGCACCCGC ATGGGGACAC 3660 

AACCGACTCC GGGGACATGC ACTCTCGGGC CCTGTGGAGG GACTGGTGCA GATGCCCACA 3720 

CACACACTCA GCCCAGACCC GTTCAACAAA CCCCGCACTG AGGTTGGCCG GCCACACGGC 3780 

CACCACACAC ACACGTGCAC GCCTCACACA CGGAGCCTCA CCCGGGCGAA CTGCACAGCA 3 840 

CCCAGACCAG AGCAAGGTCC TCGCACACGT GAACACTCCT CGGACACAGG CCCCCACGAG 3 900 



CCCCACGCGG CACCTCAAGG CCCACGAGCC TCTCGGCAGC TTCTCCACAT GCTGACCTGC 3 960 

TCAGACAAAC CCAGCCCTCC TCTCACAAGG GTGCCCCTGC AGCCGCCACA CACACACAGG 4 020 

GGATCACACA CCACGTCACG TCCCTGGCCC TGGCCCACTT CCCAGTGCCG CCCTTCCCTG 4 080 

CAGGACGGAT CAGCCTCGAC TGTGCCTTCT AGTTGCCAGC CATCTGTTGT TTGCCCCTCC 414 0 

CCCGTGCCTT CCTTGACCCT GGAAGGTGCC ACTCCCACTG TCCTTTCCTA ATAAAATGAG 4200 

GAAATTGCAT CGCATTGTCT GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG 4260 

GACAGCAAGG GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT 4320 

ATGGCTTCTG AGGCGGAAAG AACCAGCTGG GGCTCTAGGG GGTATCCCCA CGCGCCCTGT 43 80 

AGCGGCGCAT TAAGCGCGGC GGGTGTGGTG GTTACGCGCA GCGTGACCGC TACACTTGCC 4440 

AGCGCCCTAG CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC GTTCGCCGGG 4 500 

CCTCTCAAAA AAGGGAAAAA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC 4 560 

CTAACTCCGC CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC 4620 

TGACTAATTT TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG 4680 

AAGTAGTGAG GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGG ACAGCTCAGG 4 74 0 

GCTGCGATTT CGCGCCAAAC TTGACGGCAA TCCTAGCGTG AAGGCTGGTA GGATTTTATC 4 800 

CCCGCTGCCA TCATGGTTCG ACCATTGAAC TGCATCGTCG CCGTGTCCCA AAATATGGGG 4 860 

ATTGGCAAGA ACGGAGACCT ACCCTGGCCT CCGCTCAGGA ACGAGTTCAA GTACTTCCAA 4 92 0 

AGAATGACCA CAACCTCTTC AGTGGAAGGT AAACAGAATC TGGTGATTAT GGGTAGGAAA 4 980 

ACCTGGTTCT CCATTCCTGA GAAGAATCGA CCTTTAAAGG ACAGAATTAA TATAGTTCTC 504 0 

AGTAGAGAAC TCAAAGAACC ACCACGAGGA GCTCATTTTC TTGCCAAAAG TTTGGATGAT 5100 

GCCTTAAGAC TTATTGAACA ACCGGAATTG GCAAGTAAAG TAGACATGGT TTGGATAGTC 5160 

GGAGGCAGTT CTGTTTACCA GGAAGCCATG AATCAACCAG GCCACCTTAG ACTCTTTGTG 522 0 

ACAAGGATCA TGCAGGAATT TGAAAGTGAC ACGTTTTTCC CAGAAATTGA TTTGGGGAAA 5280 

TATAAACTTC TCCCAGAATA CCCAGGCGTC CTCTCTGAGG TCCAGGAGGA AAAAGGCATC 5340 

AAGTATAAGT TTGAAGTCTA CGAGAAGAAA GACTAACAGG AAGATGCTTT CAAGTTCTCT 540 0 

GCTCCCCTCC TAAAGCTATG CATTTTTATA AGACCATGGG ACTTTTGCTG GCTTTAGATC 5460 

TCTTTGTGAA GGAACCTTAC TTCTGTGGTG TGACATAATT GGACAAACTA CCTACAGAGA 5520 

TTTAAAGCTC TAAGGTAAAT ATAAAATTTT TAAGTGTATA ATGTGTTAAA CTACTGATTC 5580 

TAATTGTTTG TGTATTTTAG ATTCCAACCT ATGGAACTGA TGAATGGGAG CAGTGGTGGA 564 0 

ATGCCTTTAA TGAGGAAAAC CTGTTTTGCT CAGAAGAAAT GCCATCTAGT GATGATGAGG 5700 

CTACTGCTGA CTCTCAACAT TCTACTCCTC CAAAAAAGAA GAGAAAGGTA GAAGACCCCA 5760 

AGGACTTTCC TTCAGAATTG CTAAGTTTTT TGAGTCATGC TGTGTTTAGT AATAGAACTC 582 0 

TTGCTTGCTT TGCTATTTAC ACCACAAAGG AAAAAGCTGC ACTGCTATAC AAGAAAATTA 5880 

TGGAAAAATA TTCTGTAACC TTTATAAGTA GGCATAACAG TTATAATCAT AACATACTGT 594 0 

TTTTTCTTAC TCCACACAGG CATAGAGTGT CTGCTATTAA TAACTATGCT CAAAAATTGT 600 0 

GTACCTTTAG CTTTTTAATT TGTAAAGGGG TTAATAAGGA ATATTTGATG TATAGTGCCT 6 060 

TGACTAGAGA TCATAATCAG CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC 6120 

CTCCCACACC TCCCCCTGAA CCTGAAACAT AAAATGAATG CAATTGTTGT TGTTAACTTG 6180 

TTTATTGCAG CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA 624 0 

GCATTTTTTT CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT 63 00 

GTCTGGATCG GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC 6360 

CCCAACTTGT TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC 642 0 

ACAAATAAAG CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA 64 80 

TCTTATCATG TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG 654 0 

CTGTTTCCTG TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC 6600 

ATAAAGTGTA AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC 6660 

TCACTGCCCG CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA 6 720 

CGCGCGGGGA GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG 6 780 

CTGCGCTCGG TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG 684 0 

TTATCCACAG AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG 6900 

GCCAGGAACC GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC 6960 

GAGCATCACA AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA 702 0 

TACCAGGCGT TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT 7080 

ACCGGATACC TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC 714 0 

TGTAGGTATC TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC 7200 

CCCGTTCAGC CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA 7260 

AGACACGACT TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT 7320 

GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA 73 80 

GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT 744 0 

TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT 7500 



ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT 7560 

CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC 762 0 

ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA 7680 

ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA 774 0 

TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC 7800 

TTACCATCTG GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT 7860 

TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA 792 0 

TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 7980 

AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT 8040 

GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG 8100 

TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC 8160 

GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC 8220 

GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG 8280 

CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA 834 0 

ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA 8400 

CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT 8460 

TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG' 852 0 

GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA 8580 

AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT 864 0 

AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT C 8691 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GACGGATCGG GAGATCTGCT AGGTGACCTG AGGCGCGCCG GCTTCGAATA GCCAGAGTAA 60 

CCTTTTTTTT TAATTTTATT TTATTTTATT TTTGAGATGG AGTTTGGCGC CGATCTCCCG 120 

ATCCCCTATG GTCGACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAGTATC 180 

TGCTCCCTGC TTGTGTGTTG GAGGTCGCTG AGTAGTGCGC GAGCAAAATT TAAGCTACAA 24 0 

CAAGGCAAGG CTTGACCGAC AATTGCATGA AGAATCTGCT TAGGGTTAGG CGTTTTGCGC 3 00 

TGCTTCGCGA TGTACGGGCC AGATATACGC GTTGACATTG ATTATTGACT AGTTATTAAT 360 

AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC GTTACATAAC 420 

TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG ACGTCAATAA 480 

TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA TGGGTGGACT 54 0 

ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA AGTACGCCCC 600 

CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC ATGACCTTAT 660 

GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC ATGGTGATGC 720 

GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA TTTCCAAGTC 780 

TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG GACTTTCCAA 84 0 

AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA CGGTGGGAGG 900 

TCTATATAAG CAGAGCTCTC TGGCTAACTA GAGAACCCAC TGCTTACTGG CTTATCGAAA 960 

TTAATACGAC TCACTATAGG GAGACCCAAG CTTGGTACCA ATTTAAATTG ATATCTCCTT 102 0 

AGGTCTCGAG TCTCTAGATA ACCGGTCAAT CGATTGGAAT TCTTGCGGCC GCTTGCTAGC 108 0 

CACCATGGAG TTGTGGTTAA GCTTGGTCCT TCCTTGTCCT TGTTTTAAAA GGTGTCCAGT 114 0 

GTGAAGTGAA TCTGGTGGAG TCTGGGGGAG GCTTAGTGCA GCCTGGAGGG TCCCTGAAAG 1200 

TCTCCTGTGT AACCTCTGGA TTCACTTTCA GTGACTATTA CATGTATTGG GTTCGCCAGA 1260 

CTCCAGAGAA GAGGCTGGAG TGGGTCGCAT ACATTAGTCA AGGTGGTGAT ATAACCGACT 132 0 

ATCCAGACAC TGTAAAGGGT CGATTCACCA TCTCCAGAGA CAATGCCAAG AACACCCTGT 1380 

ACCTGCAAAT GAGCCGTCTG AAGTCTGAGG ACACAGCCAT GTATTACTGT GCAAGAGGCC 144 0 

TGGACGACGG GGCCTGGTTT GCTTACTGGG GCCAAGGGAC TCTGGTCACG GTCTCTGTAG 1500 

CTAGCACCAA GGGCCCATCG GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG 1560 

GCACAGCGGC CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAACCGGTG ACGGTGTCGT 1620 



GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA CAGTCCTCAG 1680 

GACTCTACTC CCTCAGCAGC GTGGTCACCG TGCCCTCCAG CAGCTTGGGC ACCCAGACCT 1740 

ACATCTGCAA CGTGAATCAC AAGCCCAGCA ACACCAAGGT GGACAAGAAA GTTGGTGAGA 1800 

GGCCAGCACA GGGAGGGAGG GTGTCTGCTG GAAGCCAGGC TCAGCGCTCC TGCCTGGACG 1860 

CATCCCGGCT ATGCAGCCCC AGTCCAGGGC AGCAAGGCAG GCCCCGTCTG CCTCTTCACC 1920 

CGGAGGCCTC TGCCCGCCCC ACTCATGCTC AGGGAGAGGG TCTTCTGGCT TTTTCCCCAG 1980 

GCTCTGGGCA GGCACAGGCT AGGTGCCCCT AACCCAGGCC CTGCACACAA AGGGGCAGGT 204 0 

GCTGGGCTCA GACCTGCCAA GAGCCATATC CGGGAGGACC CTGCCCCTGA CCTAAGCCCA 2100 

CCCCAAAGGC CAAACTCTCC ACTCCCTCAG CTCGGACACC TTCTCTCCTC CCAGATTCCA 2160 

GTAACTCCCA ATCTTCTCTC TGCAGAGCCC AAATCTTGTG ACAAAACTCA CACATGCCCA 2220 

CCGTGCCCAG GTAAGCCAGC CCAGGCCTCG CCCTCCAGCT CAAGGCGGGA CAGGTGCCCT 22 80 

AGAGTAGCCT GCATCCAGGG ACACACCACG TGGGTACCAA CATGTCCGGA GCCACATGGA 234 0 

CAGAGGCCGG CTCGGCCCAC CCTCTGCCCT GAGAGTGACC GCTGTACCAA CCTCTGTCCC 24 00 

TACAGGGCAG CCCCGAGAAC CACAGGTGTA CACCCTGCCC CCATCCCGGG ATGAGCTGAC 2460 

CAAGAACCAG GTCAGCCTGA CCTGCCTGGT CAAAGGCTTC TATCCCAGCG ACATCGCCGT 2520 

GGAGTGGGAG AGCAATGGGC AGCCGGAGAA CAACTACAAG ACCACGCCTC CCGTGCTGGA 2580 

CTCCGACGGC TCCTTCTTCC TCTACAGCAA GCTCACCGTG GACAAGAGCA GGTGGCAGCA 2640 

GGGGAACGTC TTCTCATGCT CCGTGATGCA TGAGGCTCTG CACAACCACT ACACGCAGAA 2700 

GAGCCTCTCC CTGTCTCCGG GTAAATGAGT GCGACGGCCG GCAAGCCCCC GCTCCCCGGG 2760 

CTCTCGCGGT CGCACGAGGA TGCTTGGCAC GTACCCCCTG TACATACTTC. CCGGGCGCCC 2820 

AGCATGGAAA TAAAGCACCC AGCGCTGCCC TGGGCCCCTG CGAGACTGTG ATGGTTCTTT 2880 

CCACGGGTCA GGCCGAGTCT GAGGCCTGAG TGGCATGAGG GAGGCAGAGC GGGTCCCACT 2940 

GTCCCCACAC TGGCCCAGGC TGTGCAGGTG TGCCTGGGCC CCCTAGGGTG GGGCTCAGCC 3 0 00 

AGGGGCTGCC CTCGGCAGGG TGGGGGATTT GCCAGCGTGG CCCTCCCTCC AGCAGCACCT 3 060 

GCCCTGGGCT GGGCCACGGG AAGCCCTAGG AGCCCCTGGG GACAGACACA CAGCCCCTGC 3120 

CTCTGTAGGA GACTGTCCTG TTCTGTGAGC GCCCCTGTCC TCCCGACCTC CATGCCCACT 3180 

CGGGGGCATG CCTAGTCCAT GTGCGTAGGG ACAGGCCCTC CCTCACCCAT CTACCCCCAC 3240 

GGCACTAACC CCTGGCTGCC CTGCCCAGCC TCGCACCCGC ATGGGGACAC AACCGACTCC 33 00 

GGGGACATGC ACTCTCGGGC CCTGTGGAGG GACTGGTGCA GATGCCCACA CACACACTCA 33 60 

GCCCAGACCC GTTCAACAAA CCCCGCACTG AGGTTGGCCG GCCACACGGC CACCACACAC 3420 

ACACGTGCAC GCCTCACACA CGGAGCCTCA CCCGGGCGAA CTGCACAGCA CCCAGACCAG 34 80 

AGCAAGGTCC TCGCACACGT GAACACTCCT CGGACACAGG CCCCCACGAG CCCCACGCGG 354 0 

CACCTCAAGG CCCACGAGCC TCTCGGCAGC TTCTCCACAT GCTGACCTGC TCAGACAAAC 3600 

CCAGCCCTCC TCTCACAAGG GTGCCCCTGC AGCCGCCACA CACACACAGG GGATCACACA 3660 

CCACGTCACG TCCCTGGCCC TGGCCCACTT CCCAGTGCCG CCCTTCCCTG CAGGACGGAT 3 720 

CAGCCTCGAC TGTGCCTTCT AGTTGCCAGC CATCTGTTGT TTGCCCCTCC CCCGTGCCTT 3 780 

CCTTGACCCT GGAAGGTGCC ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT 3 840 

CGCATTGTCT GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG 3 900 

GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT ATGGCTTCTG 3 960 

AGGCGGAAAG AACCAGCTGG GGCTCTAGGG GGTATCCCCA CGCGCCCTGT AGCGGCGCAT 4 020 

TAAGCGCGGC GGGTGTGGTG GTTACGCGCA GCGTGACCGC TACACTTGCC AGCGCCCTAG 4 080 

CGCCCGCTCC TTTCGCTTTC TTCCCTTCCT TTCTCGCCAC GTTCGCCGGG CCTCTCAAAA 4140 

AAGGGAAAAA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC CTAACTCCGC 4200 

CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC TGACTAATTT 4260 

TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG AAGTAGTGAG 4320 

GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGG ACAGCTCAGG GCTGCGATTT 43 80 

i CGCGCCAAAC TTGACGGCAA TCCTAGCGTG AAGGCTGGTA GGATTTTATC CCCGCTGCCA 444 0 

TCATGGTTCG ACCATTGAAC TGCATCGTCG CCGTGTCCCA AAATATGGGG ATTGGCAAGA 4500 

ACGGAGACCT ACCCTGGCCT CCGCTCAGGA ACGAGTTCAA GTACTTCCAA AGAATGACCA 4560 

CAACCTCTTC AGTGGAAGGT AAACAGAATC TGGTGATTAT GGGTAGGAAA ACCTGGTTCT 4620 

CCATTCCTGA GAAGAATCGA CCTTTAAAGG ACAGAATTAA TATAGTTCTC AGTAGAGAAC 4680 

TCAAAGAACC ACCACGAGGA GCTCATTTTC TTGCCAAAAG TTTGGATGAT GCCTTAAGAC 4 74 0 

TTATTGAACA ACCGGAATTG GCAAGTAAAG TAGACATGGT TTGGATAGTC GGAGGCAGTT 4800 

CTGTTTACCA GGAAGCCATG AATCAACCAG GCCACCTTAG ACTCTTTGTG ACAAGGATCA 4860 

TGCAGGAATT TGAAAGTGAC ACGTTTTTCC CAGAAATTGA TTTGGGGAAA TATAAACTTC 4920 

TCCCAGAATA CCCAGGCGTC CTCTCTGAGG TCCAGGAGGA AAAAGGCATC AAGTATAAGT 4 980 

TTGAAGTCTA CGAGAAGAAA GACTAACAGG AAGATGCTTT CAAGTTCTCT GCTCCCCTCC 5040 

TAAAGCTATG CATTTTTATA AGACCATGGG ACTTTTGCTG GCTTTAGATC TCTTTGTGAA 5100 

GGAACCTTAC TTCTGTGGTG TGACATAATT GGACAAACTA CCTACAGAGA TTTAAAGCTC 5160 

TAAGGTAAAT ATAAAATTTT TAAGTGTATA ATGTGTTAAA CTACTGATTC TAATTGTTTG 5220 



TGTATTTTAG ATTCCAACCT ATGGAACTGA TGAATGGGAG CAGTGGTGGA ATGCCTTTAA 5280 

TGAGGAAAAC CTGTTTTGCT CAGAAGAAAT GCCATCTAGT GATGATGAGG CTACTGCTGA 534 0 

CTCTCAACAT TCTACTCCTC CAAAAAAGAA GAGAAAGGTA GAAGACCCCA AGGACTTTCC 54 00 

TTCAGAATTG CTAAGTTTTT TGAGTCATGC TGTGTTTAGT AATAGAACTC TTGCTTGCTT 5460 

TGCTATTTAC ACCACAAAGG AAAAAGCTGC ACTGCTATAC AAGAAAATTA TGGAAAAATA 552 0 

TTCTGTAACC TTTATAAGTA GGCATAACAG TTATAATCAT AACATACTGT TTTTTCTTAC 5580 

TCCACACAGG CATAGAGTGT CTGCTATTAA TAACTATGCT CAAAAATTGT GTACCTTTAG 5640 

CTTTTTAATT TGTAAAGGGG TTAATAAGGA ATATTTGATG TATAGTGCCT TGACTAGAGA 5700 

TCATAATCAG CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC CTCCCACACC 5760 

TCCCCCTGAA CCTGAAACAT AAAATGAATG CAATTGTTGT TGTTAACTTG TTTATTGCAG 5820 

CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA GCATTTTTTT 5880 

CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT GTCTGGATCG 594 0 

GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC CCCAACTTGT 6000 

TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 6060 

CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 6120 

TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG CTGTTTCCTG 6180 

TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA 624 0 

AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 63 00 

CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 63 60 

GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG 642 0 

TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 64 8 0 

AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 654 0 

GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 660 0 

AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 666 0 

TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 672 0 

TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC 6780 

TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 684 0 

CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 6900 

TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 6960 

CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 702 0 

TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 708 0 

AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 714 0 

AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 7200 

AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 7260 

TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 732 0 

ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 73 8 0 

CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 744 0 

GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 750 0 

TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 7560 

TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 762 0 

GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 768 0 

CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 774 0 

AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 7800 

CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 7860 

TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA 7920 

GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG 7980 

TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 804 0 

GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 8100 

CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 8160 

CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC 822 0 

AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 828 0 

GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT C 8321 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGTACCAATT TAAATTGATA TCTCCTTAGG TCTCGAGCAC CATGAAGTTG CCTGTTAGGC 60 

TGTTGGTGCT GATGTTCTGG ATTCCTGCTT CCAGCAGTGA TGTTTTGATG ACCCAAATTC 120 

CAGTCTCCCT GCCTGTCAGT CTTGGAGATC AAGCGTCCAT CTCTTGCAGA TCTAGTCAGA 180 

TCATTGTACA TAATAATGGC AACACCTATT TAGAATGGTA CCTGCAGAAA CCAGGCCAGT 24 0, 

CTCCACAGCT CCTGATCTAC AAAGTTTCCA ACCGATTTTC TGGGGTCCCA GACAGGTTCA 3 00 

GCGGCAGTGG ATCAGGGACA GATTTCACAC TCAAGATCAG CAGAGTGGAG GCTGAGGATC 3 60 

TGGGAGTTTA TTACTGCTTT CAAGGTTCAC ATGTTCCATT CACGTTCGGC TCGGGGACAA 420 

AGTTGGAAAT AAAACGTAAG TCTCGAGTCT CTAGATAACC GGTCAATCGA TTGGAATTCT 4 80 

AAACTCTGAG GGGGTCGGAT GACGTGGCCA TTCTTTGCCT AAAGCATTGA GTTTACTGCA 540 

AGGTCAGAAA AGCATGCAAA GCCCTCAGAA TGGCTGCAAA GAGCTCCAAC AAAACAATTT 600 

AGAACTTTAT TAAGGAATAG GGGGAAGCTA GGAAGAAACT CAAAACATCA AGATTTTAAA 660 

TACGCTTCTT GGTCTCCTTG CTATAATTAT CTGGGATAAG CATGCTGTTT TCTGTCTGTC 720 

CCTAACATGC CCTTATCCGC AAACAACACA CCCAAGGGCA GAACTTTGTT ACTTAAACAC 780 

CATCCTGTTT GCTTCTTTCC TCAGGAACTG TGGCTGCACC ATCTGTCTTC ATCTTCCCGC 840 

CATCTGATGA GCAGTTGAAA TCTGGAACTG CCTCTGTTGT GTGCCTGCTG AATAACTTCT 900 

ATCCCAGAGA GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 960 

AGGAGAGTGT CACAGAGCAG GAGAGCAAGG ACAGCACCTA CAGCCTCAGC AGCACCCTGA 1020 

CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC CTGCGAAGTC ACCCATCAGG 1080 

GCCTGAGCTC GCCCGTCACA AAGAGCTTCA ACAGGGGAGA GTGTTAGAGG GAGAAGTGCC 114 0 

CCCACCTGCT CCTCAGTTCC AGCCTGACCC CCTCCCATCC TTTGGCCTCT GACCCTTTTT 12 00 

CCACAGGGGA CCTACCCCTA TTGCGGTCCT CCAGCTCATC TTTCACCTCA CCCCCCTCCT 1260 

CCTCCTTGGC TTTAATTATG CTAATGTTGG AGGAGAATGA ATAAATAAAG TGAATCTTTG 1320 

CACCTGTGGT TTCTCTCTTT CCTCATTTAA TAATTATTAT CTGTTGTTTT ACCAACTACT 13 80 

CAATTTCTCT TATAAGGGAC TAAATATGTA GTCATCCTAA GGCACGTAAC CATTTATAAA 144 0 

AATCATCCTT CATTCTATTT TACCCTATCA TCCTCTGCAA GACAGTCCTC CCTCAAACCC 1500 

ACAAGCCTTC TGTCCTCACA GTCCCCTGGG CCATGGTAGG AGAGACTTGC TTCCTTGTTT ' 1560 

TCCCCTCCTC AGCAAGCCCT CATAGTCCTT TTTAAGGGTG ACAGGTCTTA CAGTCATATA 1620 

TCCTTTGATT CAATTCCCTG AGAATCAACC AAAGCAAATT TTTCAAAAGA AGAAACCTGC 1680 

TATAAAGAGA ATCATTCATT GCAACATGAT ATAAAATAAC AACACAATAA AAGCAATTAA 174 0 

ATAAACAAAC AATAGGGAAA TGTTTAAGTT CATCATGGTA CTTAGACTTA ATGGAATGTC 1800 

ATGCCTTATT TACATTTTTA AACAGGTACT GAGGGACTCC TGTCTGCCAA GGGCCGTATT 1860 

GAGTACTTTC CACAACCTAA TTTAATCCAC ACTATACTGT GAGATTAAAA ACATTCATTA 1920 

AAATGTTGCA AAGGTTCTAT AAAGCTGAGA GACAAATATA TTCTATAACT CAGCAATCCC 1980 

ACTTCTAGAT GACTGAGTGT CCCCACCCAC CAAAAAACTA TGCAAGAATG TTCAAAGCAG 204 0 

CTTTATTTAC AAAAGCCAAA AATTGGAAAT AGCCCGATTG TCCAACAATA GAATGAGTTA 2100 

TTAAACTGTG GTATGTTTAT ACATTAGAAT ACCCAATGAG GAGAATTAAC AAGCTACAAC 2160 

TATACCTACT CACACAGATG AATCTCATAA AAATAATGTT ACATAAGAGA AACTCAATGC 2220 

AAAAGATATG TTCTGTATGT TTTCATCCAT ATAAAGTTCA AAACCAGGTA AAAATAAAGT 2280 

TAGAAATTTG GATGGAAATT ACTCTTAGCT GGGGGTGGGC GAGTTAGTGC CTGGGAGAAG 2340 

ACAAGAAGGG GCTTCTGGGG TCTTGGTAAT GTTCTGTTCC TCGTGTGGGG TTGTGCAGTT 2400 

ATGATCTGTG CACTGTTCTG TATACACATT ATGCTTCAAA ATAACTTCAC ATAAAGAACA 24 60 

TCTTATACCC AGTTAATAGA TAGAAGAGGA ATAAGTAATA GGTCAAGACC AACGCAGCTG 2520 

GTAAGTGGGG GCCTGGGATC AAATAGCTAC CTGCCTAATC CTGCCCWCTT GAGCCCTGAA 2580 

TGAGTCTGCC TTCCAGGGCT CAAGGTGCTC AACAAAACAA CAGGCCTGCT ATTTTCCTGG 264 0 

CATCTGTGCC CTGTTTGGCT AGCTAGGAGC ACACATACAT AGAAATTAAA TGAAACAGAC 2700 

CTTCAGCAAG GGGACAGAGG ACAGAATTAA CCTTGCCCAG ACACTGGAAA CCCATGTATG 2760 

AACACTCACA TGTTTGGGAA GGGGGAAGGG CACATGTAAA TGAGGACTCT TCCTCATTCT 2820 

ATGGGGCACT CTGGCCCTGC CCCTCTCAGC TACTCATCCA TCCAACACAC CTTTCTAAGT 28 80 

ACCTCTCTCT GCCTACACTC TGAAGGGGTT CAGGAGTAAC TAACACAGCA TCCCTTCCCT 2940 

CAAATGACTG ACAATCCCTT TGTCCTGCTT TGTTTTTCTT TCCAGTCAGT ACTGGGAAAG 30 00 

TGGGGAAGGA CAGTCATGGA GAAACTACAT AAGGAAGCAC CTTGCCCTTC TGCCTCTTGA 3060 

GAATGTTGAT GAGTATCAAA TCTTTCAAAC TTTGGAGGTT TGAGTAGGGG TGAGACTCAG 3120 

TAATGTCCCT TCCAATGACA TGAACTTGCT CACTCATCCC TGGGGGCCAA ATTGAACAAT 3180 

CAAAGGCAGG CATAATCCAG TTATGAATTC TTGCGGCCGC TTGCTAGCTT CACGTGTTGG 3240 

ATCCAACCGC GGAAGGGCCC TATTCTATAG TGTCACCTAA ATGCTAGAGC TCGCTGATCA 33 00 



GCCTCGACTG TGCCTTCTAG TTGCCAGCCA TCTGTTGTTT GCCCCTCCCC CGTGCCTTCC 3360 

TTGACCCTGG AAGGTGCCAC TCCCACTGTC CTTTCCTAAT AAAATGAGGA AATTGCATCG 342 0 

CATTGTCTGA GTAGGTGTCA TTCTATTCTG GGGGGTGGGG TGGGGCAGGA CAGCAAGGGG 3480 

GAGGATTGGG AAGACAATAG CAGGCATGCT GGGGATGCGG TGGGCTCTAT GGCTTCTGAG 3540 

GCGGAAAGAA CCAGCTGGGG CTCTAGGGGG TATCCCCACG CGCCCTGTAG CGGCGCATTA 3600 

AGCGCGGCGG GTGTGGTGGT TACGCGCAGC GTGACCGCTA CACTTGCCAG CGCCCTAGCG 3 660 

CCCGCTCCTT TCGCTTTCTT CCCTTCCTTT CTCGCCACGT TCGCCGGGCC TCTCAAAAAA 3720 

GGGAAAAAAA GCATGCATCT CAATTAGTCA GCAACCATAG TCCCGCCCCT AACTCCGCCC 3780 

ATCCCGCCCC TAACTCCGCC CAGTTCCGCC CATTCTCCGC CCCATGGCTG ACTAATTTTT 3 84 0 

TTTATTTATG CAGAGGCCGA GGCCGCCTCG GCCTCTGAGC TATTCCAGAA GTAGTGAGGA 3 900 

GGCTTTTTTG GAGGCCTAGG CTTTTGCAAA AAGCTTGGAC AGCTCAGGGC TGCGATTTCG 3960 

CGCCAAACTT GACGGCAATC CTAGCGTGAA GGCTGGTAGG ATTTTATCCC CGCTGCCATC 4 02 0 

ATGGTTCGAC CATTGAACTG CATCGTCGCC GTGTCCCAAA ATATGGGGAT TGGCAAGAAC 4 080 

GGAGACCTAC CCTGGCCTCC GCTCAGGAAC GAGTTCAAGT ACTTCCAAAG AATGACCACA 4140 

ACCTCTTCAG TGGAAGGTAA ACAGAATCTG GTGATTATGG GTAGGAAAAC CTGGTTCTCC 4200 

ATTCCTGAGA AGAATCGACC TTTAAAGGAC AGAATTAATA TAGTTCTCAG TAGAGAACTC 4260 

AAAGAACCAC CACGAGGAGC TCATTTTCTT GCCAAAAGTT TGGATGATGC CTTAAGACTT 4320 

ATTGAACAAC CGGAATTGGC AAGTAAAGTA GACATGGTTT GGATAGTCGG AGGCAGTTCT 43 80 

GTTTACCAGG AAGCCATGAA TCAACCAGGC CACCTTAGAC TCTTTGTGAC AAGGATCATG 4440 

CAGGAATTTG AAAGTGACAC GTTTTTCCCA GAAATTGATT TGGGGAAATA TAAACTTCTC 4500 

CCAGAATACC CAGGCGTCCT CTCTGAGGTC CAGGAGGAAA AAGGCATCAA GTATAAGTTT 4560 

GAAGTCTACG AGAAGAAAGA CTAACAGGAA GATGCTTTCA AGTTCTCTGC TCCCCTCCTA 4620 

AAGCTATGCA TTTTTATAAG ACCATGGGAC TTTTGCTGGC TTTAGATCTC TTTGTGAAGG 4680 

AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA 474 0 

AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA ATTGTTTGTG 4 800 

TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTGGAAT GCCTTTAATG 4 86 0 

AGGAAAACCT GTTTTGCTCA GAAGAAATGC CATCTAGTGA TGATGAGGCT ACTGCTGACT 4 92 0 

CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGACCCCAAG GACTTTCCTT 4 98 0 

CAGAATTGCT AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 504 0 

CTATTTACAC CACAAAGGAA AAAGCTGCAC TGCTATACAA GAAAATTATG GAAAAATATT 510 0 

CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA CATACTGTTT TTTCTTACTC 516 0 

CACACAGGCA TAGAGTGTCT GCTATTAATA ACTATGCTCA AAAATTGTGT ACCTTTAGCT 522 0 

TTTTAATTTG TAAAGGGGTT AATAAGGAAT ATTTGATGTA TAGTGCCTTG ACTAGAGATC 528 0 

ATAATCAGCC ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC 534 0 

CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT 540 0 

TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA 546 0 

CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATGATGT CTGGATCGGC 552 0 

TGGATGATCC TCCAGCGCGG GGATCTCATG CTGGAGTTCT TCGCCCACCC CAACTTGTTT 558 0 

ATTGCAGCTT ATAATGGTTA CAAATAAAGC AATAGCATCA CAAATTTCAC AAATAAAGCA 564 0 

TTTTTTTCAC TGCATTCTAG TTGTGGTTTG TCCAAACTCA TCAATGTATC TTATCATGTC 5700 

TGTATACCGT CGACCTCTAG CTAGAGCTTG GCGTAATCAT GGTCATAGCT GTTTCCTGTG 576 0 

TGAAATTGTT ATCCGCTCAC AATTCCACAC AACATACGAG CCGGAAGCAT AAAGTGTAAA 582 0 

GCCTGGGGTG CCTAATGAGT GAGCTAACTC ACATTAATTG CGTTGCGCTC ' ACTGCCCGCT 5880 

TTCCAGTCGG GAAACCTGTC GTGCCAGCTG CATTAATGAA TCGGCCAACG CGCGGGGAGA 594 0 

GGCGGTTTGC GTATTGGGCG CTCTTCCGCT TCCTCGCTCA CTGACTCGCT GCGCTCGGTC 6000 

GTTCGGCTGC GGCGAGCGGT ATCAGCTCAC TCAAAGGCGG TAATACGGTT ATCCACAGAA 6060 

TCAGGGGATA ACGCAGGAAA GAACATGTGA GCAAAAGGCC AGCAAAAGGC CAGGAACCGT 6120 

AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA GCATCACAAA 6180 

AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA CCAGGCGTTT 624 0 

CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC CGGATACCTG 6300 

TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG TAGGTATCTC 6360 

AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC CGTTCAGCCC 6420 

GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG ACACGACTTA 64 80 

TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT AGGCGGTGCT 6540 

ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT ATTTGGTATC 6600 

TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG ATCCGGCAAA 6660 

CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC GCGCAGAAAA 672 0 

AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA GTGGAACGAA 6 780 

AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC CTAGATCCTT 684 0 

TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC TTGGTCTGAC 6900 



AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT TCGTTCATCC 6960 

ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT ACCATCTGGC 7020 

CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT ATCAGCAATA 7080 

AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC CGCCTCCATC 714 0 

CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA TAGTTTGCGC 72 00 

AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG TATGGCTTCA 7260 

TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT GTGCAAAAAA 73 20 

GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC AGTGTTATCA 73 80 

CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT AAGATGCTTT 7440 

TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG GCGACCGAGT 7500 

TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC TTTAAAAGTG 7560 

CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC GCTGTTGAGA 7620 

TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT TACTTTCACC 7680 

AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG AATAAGGGCG 7740 

ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG CATTTATCAG 7800 

GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA ACAAATAGGG 7860 

GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTGACGTCG ACGGATCGGG AGATCTGCTA 7920 

GCCCGGGTGA CCTGAGGCGC GCCGGCTTCG AATAGCCAGA GTAACCTTTT TTTTTAATTT 7980 

TATTTTATTT TATTTTTGAG ATGGAGTTTG GCGCCGATCT CCCGATCCCC TATGGTCGAC 8040 

TCTCAGTACA ATCTGCTCTG ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT 8100 

GTTGGAGGTC GCTGAGTAGT GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC 8160 

CGACAATTGC ATGAAGAATC TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG 8220 

GGCCAGATAT ACGCGTTGAC ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG 82 80 

GTCATTAGTT CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC 8340 

GCCTGGCTGA CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT 8400 

AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG GACTATTTAC GGTAAACTGC 8460 

CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA 8520 

CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG 8580 

GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT 8640 

CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT 8700 

CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC 8760 

CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC 8820 

TCTCTGGCTA ACTAGAGAAC CCACTGCTTA CTGGCTTATC GAAATTAATA CGACTCACTA 8880 

TAGGGAGACC CAAGCTT 8897 

(2) INFORMATION FOR SEQ ID NO:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GGTACCAATT TAAATTGATA TCTCCTTAGG TCTCGAGTCT CTAGATAACC GGTCAATCGA 60 

TTGGAATTCT TGCGGCCGCT TGCTAGCCAC CATGGAGTTG TGGTTAAGCT TGGTCTTCCT 120 

TGTCCTTGTT TTAAAAGGTG TCCAGTGTGA AGTGCAACTG GTGGAGTCTG GGGGAGGCTT 180 

AGTGCAGCCT GGAGGGTCCC TGCGACTTTC CTGTGCTGCA TCTGGATTCC CGTTCAGTGA 240 

CTATTACATG TATTGGGTTC GCCAGGCTCC AGGCAAGGGA CTGGAGTGGG TCTCATACAT 3 00 

TAGTCAAGAT GGTGATATAA CCGACTATGC AGACTCCGTA AAGGGTCGAT TCACCATCTC 360 

CAGAGACAAT GCAAAGAACA GCCTGTACCT GCAAATGAAC AGCCTGAGGG ACGAGGACAC 420 

AGCCGTGTAT TACTGTGCAA GAGGCCTGGC GGACGGGGCC TGGTTTGCTT ACTGGGGCCA 4 80 

AGGGACTCTG GTCACGGTCT CTTCCGCTAG CACCAAGGGC CCATCGGTCT TCCCCCTGGC 540 

ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG TCAAGGACTA 600 

CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC 660 

CTTCCCGGCT GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TCACCGTGCC 72 0 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC 780 



CAAGGTGGAC AAGAAAGTTG GTGAGAGGCC AGCACAGGGA GGGAGGGTGT CTGCTGGAAG 84 0 

CCAGGCTCAG CGCTCCTGCC TGGACGCATC CCGGCTATGC AGCCCCAGTC CAGGGCAGCA 900 

AGGCAGGCCC CGTCTGCCTC TTCACCCGGA GGCCTCTGCC CGCCCCACTC ATGCTCAGGG 960 

AGAGGGTCTT CTGGCTTTTT CCCCAGGCTC TGGGCAGGCA CAGGCTAGGT GCCCCTAACC 1020 

CAGGCCCTGC ACACAAAGGG GCAGGTGCTG GGCTCAGACC TGCCAAGAGC CATATCCGGG 1080 

AGGACCCTGC CCCTGACCTA AGCCCACCCC AAAGGCCAAA CTCTCCACTC CCTCAGCTCG 1140 

GACACCTTCT CTCCTCCCAG ATTCCAGTAA CTCCCAATCT TCTCTCTGCA GAGCCCAAAT 12 00 

CTTGTGACAA AACTCACACA TGCCCACCGT GCCCAGGTAA GCCAGCCCAG GCCTCGCCCT 1260 

CCAGCTCAAG GCGGGACAGG TGCCCTAGAG TAGCCTGCAT CCAGGGACAC ACCACGTGGG 1320 

TACCAACATG TCCGGAGCCA CATGGACAGA GGCCGGCTCG GCCCACCCTC TGCCCTGAGA 13 80 

GTGACCGCTG TACCAACCTC TGTCCCTACA GGGCAGCCCC GAGAACCACA GGTGTACACC 1440 

CTGCCCCCAT CCCGGGATGA GCTGACCAAG AACCAGGTCA GCCTGACCTG CCTGGTCAAA 1500 

GGCTTCTATC CCAGCGACAT CGCCGTGGAG TGGGAGAGCA ATGGGCAGCC GGAGAACAAC 1560 

TACAAGACCA CGCCTCCCGT GCTGGACTCC GACGGCTCCT TCTTCCTCTA CAGCAAGCTC 1620 

ACCGTGGACA AGAGCAGGTG GCAGCAGGGG AACGTCTTCT CATGCTCCGT GATGCATGAG 1680 

GCTCTGCACA ACCACTACAC GCAGAAGAGC CTCTCCCTGT CTCCGGGTAA ATGAGTGCGA 174 0 

CGGCCGGCAA GCCCCCGCTC CCCGGGCTCT CGCGGTCGCA CGAGGATGCT TGGCACGTAC 1800 

CCCCTGTACA TACTTCCCGG GCGCCCAGCA TGGAAATAAA GCACCCAGCG CTGCCCTGGG 1860 

CCCCTGCGAG ACTGTGATGG TTCTTTCCAC GGGTCAGGCC GAGTCTGAGG CCTGAGTGGC 1920 

ATGAGGGAGG CAGAGCGGGT CCCACTGTCC CCACACTGGC CCAGGCTGTG CAGGTGTGCC 1980 

TGGGCCCCCT AGGGTGGGGC TCAGCCAGGG GCTGCCCTCG GCAGGGTGGG GGATTTGCCA 204 0 

GCGTGGCCCT CCCTCCAGCA GCACCTGCCC TGGGCTGGGC CACGGGAAGC CCTAGGAGCC 2100 

CCTGGGGACA GACACACAGC CCCTGCCTCT GTAGGAGACT GTCCTGTTCT GTGAGCGCCC 2160 

CTGTCCTCCC GACCTCCATG CCCACTCGGG GGCATGCCTA GTCCATGTGC GTAGGGACAG 2220 

GCCCTCCCTC ACCCATCTAC CCCCACGGCA CTAACCCCTG GCTGCCCTGC CCAGCCTCGC 2280 

ACCCGCATGG GGACACAACC GACTCCGGGG ACATGCACTC TCGGGCCCTG TGGAGGGACT 234 0 

GGTGCAGATG CCCACACACA CACTCAGCCC AGACCCGTTC AACAAACCCC GCACTGAGGT 24 00 

TGGCCGGCCA CACGGCCACC ACACACACAC GTGCACGCCT CACACACGGA GCCTCACCCG 2460 

GGCGAACTGC ACAGCACCCA GACCAGAGCA AGGTCCTCGC ACACGTGAAC ACTCCTCGGA 2520 

CACAGGCCCC CACGAGCCCC ACGCGGCACC TCAAGGCCCA CGAGCCTCTC GGCAGCTTCT 2580 

CCACATGCTG ACCTGCTCAG ACAAACCCAG CCCTCCTCTC ACAAGGGTGC CCCTGCAGCC 264 0 

GCCACACACA CACAGGGGAT CACACACCAC GTCACGTCCC TGGCCCTGGC CCACTTCCCA 2700 

GTGCCGCCCT TCCCTGCAGG ACGGATCAGC CTCGACTGTG CCTTCTAGTT GCCAGCCATC 2760 

TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC CCACTGTCCT 282 0 

TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT CTATTCTGGG 28 80 

GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA GGCATGCTGG 2940 

GGATGCGGTG GGCTCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT CTAGGGGGTA 3 000 

TCCCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA CGCGCAGCGT 3060 

GACCG CTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC CTTCCTTTCT 3120 

CGCCACGTTC GCCGGGCCTC TCAAAAAAGG GAAAAAAAGC ATGCATCTCA ATTAGTCAGC 3180 

AACCATAGTC CCGCCCCTAA CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA GTTCCGCCCA 324 0 

TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG CCGCCTCGGC 33 00 

CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT TTTGCAAAAA 3360 

GCTTGGACAG CTCAGGGCTG CGATTTCGCG CCAAACTTGA CGGCAATCCT AGCGTGAAGG 3420 

CTGGTAGGAT TTTATCCCCG CTGCCATCAT GGTTCGACCA TTGAACTGCA TCGTCGCCGT 34 80 

GTCCCAAAAT ATGGGGATTG GCAAGAACGG AGACCTACCC TGGCCTCCGC TCAGGAACGA 3540 

GTTCAAGTAC TTCCAAAGAA TGACCACAAC CTCTTCAGTG GAAGGTAAAC AGAATCTGGT 3600 

GATTATGGGT AGGAAAACCT GGTTCTCCAT TCCTGAGAAG AATCGACCTT TAAAGGACAG 3660 

AATTAATATA GTTCTCAGTA GAGAACTCAA AGAACCACCA CGAGGAGCTC ATTTTCTTGC 3720 

CAAAAGTTTG GATGATGCCT TAAGACTTAT TGAACAACCG GAATTGGCAA GTAAAGTAGA 37 80 

CATGGTTTGG ATAGTCGGAG GCAGTTCTGT TTACCAGGAA GCCATGAATC AACCAGGCCA 3 840 

CCTTAGACTC TTTGTGACAA GGATCATGCA GGAATTTGAA AGTGACACGT TTTTCCCAGA 3 900 

AATTGATTTG GGGAAATATA AACTTCTCCC AGAATACCCA GGCGTCCTCT CTGAGGTCCA 3 960 

GGAGGAAAAA GGCATCAAGT ATAAGTTTGA AGTCTACGAG AAGAAAGACT AACAGGAAGA 4020 

TGCTTTCAAG TTCTCTGCTC CCCTCCTAAA GCTATGCATT TTTATAAGAC CATGGGACTT 4080 

TTGCTGGCTT TAGATCTCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC ATAATTGGAC 4140 

AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG TGTATAATGT 42 00 

GTTAAACTAC TGATTCTAAT TGTTTGTGTA TTTTAGATTC CAACCTATGG AACTGATGAA 4260 

TGGGAGCAGT GGTGGAATGC CTTTAATGAG GAAAACCTGT TTTGCTCAGA AGAAATGCCA 43 20 

TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA AAAGAAGAGA 43 80 



AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA GTTTTTTGAG TCATGCTGTG 444 0 

TTTAGTAATA GAACTCTTGC TTGCTTTGCT ATTTACACCA CAAAGGAAAA AGCTGCACTG 4500 

CTATACAAGA AAATTATGGA AAAATATTCT GTAACCTTTA TAAGTAGGCA TAACAGTTAT 4 560 

AATCATAACA TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC TATTAATAAC 4 620 

TATGCTCAAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGGTTAA TAAGGAATAT 4 680 

TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT ACCACATTTG TAGAGGTTTT 4 74 0 

ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA TGAATGCAAT 4 800 

TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA ATAGCATCAC 4 860 

AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT CCAAACTCAT 4 92 0 

CAATGTATCT TATCATGTCT GGATCGGCTG GATGATCCTC CAGCGCGGGG ATCTCATGCT 4 98 0 

GGAGTTCTTC GCCCACCCCA ACTTGTTTAT . TGCAGCTTAT AATGGTTACA AATAAAGCAA 504 0 

TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT GTGGTTTGTC 5100 

CAAACTCATC AATGTATCTT ATCATGTCTG TATACCGTCG ACCTCTAGCT AGAGCTTGGC 5160 

GTAATCATGG TCATAGCTGT TTCCTGTGTG AAATTGTTAT CCGCTCACAA TTCCACACAA 5220 

CATACGAGCC GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA GCTAACTCAC 52 80 

ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT GCCAGCTGCA 534 0 

TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT CTTCCGCTTC 54 00 

CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT CAGCTCACTC 5460 

AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC GCAGGAAAGA ACATGTGAGC 552 0 

AAAAGGCCAG CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG 558 0 

GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC 564 0 

GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT 5700 

TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA GCGTGGCGCT 5760 

TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG GTCGTTCGCT CCAAGCTGGG 582 0 

CTGTGTGCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT 5880 

TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT 5940 

TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC CTAACTACGG 6000 

CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA CCTTCGGAAA 6060 

AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT GGTAGCGGTG GTTTTTTTGT 612 0 

TTGCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT TGATCTTTTC 6180 

TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG TCATGAGATT 6240 

ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATTAAAAA TGAAGTTTTA AATCAATCTA 63 00 

AAGTATATAT GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT 63 60 

CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG TGTAGATAAC 6420 

TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC GAGACCCACG 64 80 

CTCACCGGCT CCAGATTTAT CAGCAATAAA CCAGCCAGCC GGAAGGGCCG AGCGCAGAAG 6540 

TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT 66 00 

AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT 6660 

GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT CAAGGCGAGT 6720 

TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC CGATCGTTGT 6780 

CAGAAGTAAG TTGGCCGCAG TGTTATCACT CATGGTTATG GCAGCACTGC ATAATTCTCT 6840 

TACTGTCATG CCATCCGTAA GATGCTTTTC TGTGACTGGT GAGTACTCAA CCAAGTCATT 6900 

CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC GGGATAATAC 6960 

CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT CGGGGCGAAA 702 0 

ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC GTGCACCCAA 7080 

CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TGAGCAAAAA CAGGAAGGCA 714 0 

AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT 7200 

TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA 7260 

ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA AAGTGCCACC 7320 

TGACGTCGAC GGATCGGGAG ATCTGCTAGG TGACCTGAGG CGCGCCGGCT TCGAATAGCC 73 80 

AGAGTAACCT TTTTTTTTAA TTTTATTTTA TTTTATTTTT GAGATGGAGT TTGGCGCCGA 744 0 

TCTCCCGATC CCCTATGGTC GACTCTCAGT ACAATCTGCT CTGATGCCGC ATAGTTAAGC 7500 

CAGTATCTGC TCCCTGCTTG TGTGTTGGAG GTCGCTGAGT AGTGCGCGAG CAAAATTTAA 7560 

GCTACAACAA GGCAAGGCTT GACCGACAAT TGCATGAAGA ATCTGCTTAG GGTTAGGCGT 7620 

TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT GACATTGATT ATTGACTAGT 7680 

TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA GTTCCGCGTT 774 0 

ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG CCCATTGACG 7800 

TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG ACGTCAATGG 7860 

GTGGACTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA TATGCCAAGT 7920 

ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC CCAGTACATG 7980 



ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC TATTACCATG 
GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC ACGGGGATTT 
CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA TCAACGGGAC 
TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA TGGGCGGTAG GCGTGTACGG 
TGGGAGGTCT ATATAAGCAG AGCTCTCTGG CTAACTAGAG AACCCACTGC TTACTGGCTT 
ATCGAAATTA ATACGACTCA CTATAGGGAG ACCCAAGCTT G 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8897 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGGATCGG GAGATCTGCT AGCCCGGGTG ACCTGAGGCG CGCCGGCTTC GAATAGCCAG 60 

AGTAACCTTT TTTTTTAATT TTATTTTATT TTATTTTTGA GATGGAGTTT GGCGCCGATC 120 

TCCCGATCCC CTATGGTCGA CTCTCAGTAC AATCTGCTCT GATGCCGCAT AGTTAAGCCA 180 

GTATCTGCTC CCTGCTTGTG TGTTGGAGGT CGCTGAGTAG TGCGCGAGCA AAATTTAAGC 24 0 

TACAACAAGG CAAGGCTTGA CCGACAATTG CATGAAGAAT CTGCTTAGGG TTAGGCGTTT 3 00 

TGCGCTGCTT CGCGATGTAC GGGCCAGATA TACGCGTTGA CATTGATTAT TGACTAGTTA 3 60 

TTAATAGTAA TCAATTACGG GGTCATTAGT TCATAGCCCA TATATGGAGT TCCGCGTTAC 420 

ATAACTTACG GTAAATGGCC CGCCTGGCTG ACCGCCCAAC GACCCCCGCC CATTGACGTC 4 80 

AATAATGACG TATGTTCCCA TAGTAACGCC AATAGGGACT TTCCATTGAC GTCAATGGGT 540 

GGACTATTTA CGGTAAACTG CCCACTTGGC AGTACATCAA GTGTATCATA TGCCAAGTAC 600 

GCCCCCTATT GACGTCAATG ACGGTAAATG GCCCGCCTGG CATTATGCCC AGTACATGAC 660 

CTTATGGGAC TTTCCTACTT GGCAGTACAT CTACGTATTA GTCATCGCTA TTACCATGGT 720 

GATGCGGTTT TGGCAGTACA TCAATGGGCG TGGATAGCGG TTTGACTCAC GGGGATTTCC 780 

AAGTCTCCAC CCCATTGACG TCAATGGGAG TTTGTTTTGG CACCAAAATC AACGGGACTT 84 0 

TCCAAAATGT CGTAACAACT CCGCCCCATT GACGCAAATG GGCGGTAGGC GTGTACGGTG 900 

GGAGGTCTAT ATAAGCAGAG CTCTCTGGCT AACTAGAGAA CCCACTGCTT ACTGGCTTAT 960 

CGAAATTAAT ACGACTCACT ATAGGGAGAC CCAAGCTTGG TACCAATTTA AATTGATATC 1020 

TCCTTAGGTC TCGAGCACCA TGAAGTTGCC TGTTAGGCTG TTGGTGCTGA TGTTCTGGAT 10 80 

TCCTGCTTCC AGCAGTGATG TTGTCATGAC CCAAACCCCA CTGTCCAGTC CTGTCACGCT 1140 

TGGACAACCT GCGTCCATCT CTTGCAGATC TAGTCAGATC ATTGTACATA ATAATGGCAA 12 00 

CACCTATCTG GAATGGTACC AGCAGAGACC AGGGCAGTCT CCACGGCTCC TGATCTACAA 1260 

AGTTTCCAAC CGATTTTCTG GGGTCCCAGA CAGGTTCAGC GGCAGTGGAG CTGGGACAGA 13 20 

TTTCACACTC AAGATCAGCA GAGTGGAGGC TGAGGATGTG GGAGTTTACT ACTGCTTCCA 13 80 

GGGTTCACAT GTTCCATTCA CGTTCGGCCA AGGGACAAAG TTGGAAATCA AACGTAAGTC 1440 

TCGAGTCTCT AGATAACCGG TCAATCGATT GGAATTCTAA ACTCTGAGGG GGTCGGATGA 1500 

CGTGGCCATT CTTTGCCTAA AGCATTGAGT TTACTGCAAG GTCAGAAAAG CATGCAAAGC 1560 

CCTCAGAATG GCTGCAAAGA GCTCCAACAA AACAATTTAG AACTTTATTA AGGAATAGGG 1620 

GGAAGCTAGG AAGAAACTCA AAACATCAAG ATTTTAAATA CGCTTCTTGG TCTCCTTGCT 1680 

ATAATTATCT GGGATAAGCA TGCTGTTTTC TGTCTGTCCC TAACATGCCC TTATCCGCAA 174 0 

ACAACACACC CAAGGGCAGA ACTTTGTTAC TTAAACACCA TCCTGTTTGC TTCTTTCCTC 1800 

AGGAACTGTG GCTGCACCAT CTGTCTTCAT CTTCCCGCCA TCTGATGAGC AGTTGAAATC 1860 

TGGAACTGCC TCTGTTGTGT GCCTGCTGAA TAACTTCTAT CCCAGAGAGG CCAAAGTACA 1920 

GTGGAAGGTG GATAACGCCC TCCAATCGGG TAACTCCCAG GAGAGTGTCA CAGAGCAGGA 1980 

GAGCAAGGAC AGCACCTACA GCCTCAGCAG CACCCTGACG CTGAGCAAAG CAGACTACGA 204 0 

GAAACACAAA GTCTACGCCT GCGAAGTCAC CCATCAGGGC CTGAGCTCGC CCGTCACAAA 2100 

GAGCTTCAAC AGGGGAGAGT GTTAGAGGGA GAAGTGCCCC CACCTGCTCC TCAGTTCCAG 2160 

CCTGACCCCC TCCCATCCTT TGGCCTCTGA CCCTTTTTCC ACAGGGGACC TACCCCTATT 2220 

GCGGTCCTCC AGCTCATCTT TCACCTCACC CCCCTCCTCC TCCTTGGCTT TAATTATGCT 22 80 

AATGTTGGAG GAGAATGAAT AAATAAAGTG AATCTTTGCA CCTGTGGTTT CTCTCTTTCC 2340 

TCATTTAATA ATTATTATCT GTTGTTTTAC CAACTACTCA ATTTCTCTTA TAAGGGACTA 24 00 

AATATGTAGT CATCCTAAGG CACGTAACCA TTTATAAAAA TCATCCTTCA TTCTATTTTA 24 60 



8040 
8100 
8160 
8220 
8280 
8321 



CCCTATCATC CTCTGCAAGA CAGTCCTCCC TCAAACCCAC AAGCCTTCTG TCCTCACAGT 2520 

CCCCTGGGCC ATGGTAGGAG AGACTTGCTT CCTTGTTTTC CCCTCCTCAG CAAGCCCTCA 2580 

TAGTCCTTTT TAAGGGTGAC AGGTCTTACA GTCATATATC CTTTGATTCA ATTCCCTGAG 2640 

AATCAACCAA AGCAAATTTT TCAAAAGAAG AAACCTGCTA TAAAGAGAAT CATTCATTGC 2700 

AACATGATAT AAAATAACAA CACAATAAAA GCAATTAAAT AAACAAACAA TAGGGAAATG 2760 

TTTAAGTTCA TCATGGTACT TAGACTTAAT GGAATGTCAT GCCTTATTTA CATTTTTAAA 2820 

CAGGTACTGA GGGACTCCTG TCTGCCAAGG GCCGTATTGA GTACTTTCCA CAACCTAATT 2880 

TAATCCACAC TATACTGTGA GATTAAAAAC ATTCATTAAA ATGTTGCAAA GGTTCTATAA 2940 

AGCTGAGAGA CAAATATATT CTATAACTCA GCAATCCCAC TTCTAGATGA CTGAGTGTCC 3 000 

CCACCCACCA AAAAACTATG CAAGAATGTT CAAAGCAGCT TTATTTACAA AAGCCAAAAA 3 060 

TTGGAAATAG CCCGATTGTC CAACAATAGA ATGAGTTATT AAACTGTGGT ATGTTTATAC 3120 

ATTAGAATAC CCAATGAGGA GAATTAACAA GCTACAACTA TACCTACTCA CACAGATGAA 3180 

TCTCATAAAA ATAATGTTAC ATAAGAGAAA CTCAATGCAA AAGATATGTT CTGTATGTTT 324 0 

TCATCCATAT AAAGTTCAAA ACCAGGTAAA AATAAAGTTA GAAATTTGGA TGGAAATTAC 33 00 

TCTTAGCTGG GGGTGGGCGA GTTAGTGCCT GGGAGAAGAC AAGAAGGGGC TTCTGGGGTC 33 60 

TTGGTAATGT TCTGTTCCTC GTGTGGGGTT GTGCAGTTAT GATCTGTGCA CTGTTCTGTA 34 20 

TACACATTAT GCTTCAAAAT AACTTCACAT AAAGAACATC TTATACCCAG TTAATAGATA 34 80 

GAAGAGGAAT AAGTAATAGG TCAAGACCAA CGCAGCTGGT AAGTGGGGGC CTGGGATCAA 3540 

ATAGCTACCT GCCTAATCCT GCCCWCTTGA GCCCTGAATG AGTCTGCCTT CCAGGGCTCA 3600 

AGGTGCTCAA CAAAACAACA GGCCTGCTAT TTTCCTGGCA TCTGTGCCCT GTTTGGCTAG 3660 

CTAGGAGCAC ACATACATAG AAATTAAATG AAACAGACCT TCAGCAAGGG GACAGAGGAC 3720 

AGAATTAACC TTGCCCAGAC ACTGGAAACC CATGTATGAA CACTCACATG TTTGGGAAGG 37 80 

GGGAAGGGCA CATGTAAATG AGGACTCTTC CTCATTCTAT GGGGCACTCT GGCCCTGCCC 3840 

CTCTCAGCTA CTCATCCATC CAACACACCT TTCTAAGTAC CTCTCTCTGC CTACACTCTG 3900 

AAGGGGTTCA GGAGTAACTA ACACAGCATC CCTTCCCTCA AATGACTGAC AATCCCTTTG 3960 

TCCTGCTTTG TTTTTCTTTC CAGTCAGTAC TGGGAAAGTG GGGAAGGACA GTCATGGAGA 4 020 

AACTACATAA GGAAGCACCT TGCCCTTCTG CCTCTTGAGA ATGTTGATGA GTATCAAATC 40 80 

TTTCAAACTT TGGAGGTTTG AGTAGGGGTG AGACTCAGTA ATGTCCCTTC CAATGACATG 4140 

AACTTGCTCA CTCATCCCTG GGGGCCAAAT TGAACAATCA AAGGCAGGCA TAATCCAGTT 42 00 

ATGAATTCTT GCGGCCGCTT GCTAGCTTCA CGTGTTGGAT CCAACCGCGG AAGGGCCCTA 4260 

TTCTATAGTG TCACCTAAAT GCTAGAGCTC GCTGATCAGC CTCGACTGTG CCTTCTAGTT 43 20 

GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 43 80 

CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 4440 

CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 4500 

GGCATGCTGG GGATGCGGTG GGCTCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT 4560 

CTAGGGGGTA TCCCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA 4620 

CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 46 80 

CTTCCTTTCT CGCCACGTTC GCCGGGCCTC TCAAAAAAGG GAAAAAAAGC ATGCATCTCA 4 74 0 

ATTAGTCAGC AACCATAGTC CCGCCCCTAA CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA 4 800 

GTTCCGCCCA TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG 4860 

CCGCCTCGGC CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT 4920 

TTTGCAAAAA GCTTGGACAG CTCAGGGCTG CGATTTCGCG CCAAACTTGA CGGCAATCCT 4 980 

AGCGTGAAGG CTGGTAGGAT TTTATCCCCG CTGCCATCAT GGTTCGACCA TTGAACTGCA 504 0 

TCGTCGCCGT GTCCCAAAAT ATGGGGATTG GCAAGAACGG AGACCTACCC TGGCCTCCGC 5100 

TCAGGAACGA GTTCAAGTAC TTCCAAAGAA TGACCACAAC CTCTTCAGTG GAAGGTAAAC 5160 

AGAATCTGGT GATTATGGGT AGGAAAACCT GGTTCTCCAT TCCTGAGAAG AATCGACCTT 5220 

TAAAGGACAG AATTAATATA GTTCTCAGTA GAGAACTCAA AGAACCACCA CGAGGAGCTC 52 80 

ATTTTCTTGC CAAAAGTTTG GATGATGCCT TAAGACTTAT TGAACAACCG GAATTGGCAA 5340 

GTAAAGTAGA CATGGTTTGG ATAGTCGGAG GCAGTTCTGT TTACCAGGAA GCCATGAATC 54 00 

AACCAGGCCA CCTTAGACTC TTTGTGACAA GGATCATGCA GGAATTTGAA AGTGACACGT 54 60 

TTTTCCCAGA AATTGATTTG GGGAAATATA AACTTCTCCC AGAATACCCA GGCGTCCTCT 5520 

CTGAGGTCCA GGAGGAAAAA GGCATCAAGT ATAAGTTTGA AGTCTACGAG AAGAAAGACT 5580 

AACAGGAAGA TGCTTTCAAG TTCTCTGCTC CCCTCCTAAA GCTATGCATT TTTATAAGAC 5640 

CATGGGACTT TTGCTGGCTT ■ TAGATCTCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC 5700 

ATAATTGGAC AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG 5760 

TGTATAATGT GTTAAACTAC TGATTCTAAT TGTTTGTGTA TTTTAGATTC CAACCTATGG 5820 

AACTGATGAA TGGGAGCAGT GGTGGAATGC CTTTAATGAG GAAAACCTGT TTTGCTCAGA 58 80 

AGAAATGCCA TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA 5940 

AAAGAAGAGA AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA GTTTTTTGAG 6000 

TCATGCTGTG TTTAGTAATA GAACTCTTGC TTGCTTTGCT ATTTACACCA CAAAGGAAAA 6060 



AGCTGCACTG CTATACAAGA AAATTATGGA AAAATATTCT GTAACCTTTA TAAGTAGGCA 6120 

TAACAGTTAT AATCATAACA TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC 6180 

TATTAATAAC TATGCTCAAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGGTTAA 624 0 

TAAGGAATAT TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT ACCACATTTG 6300 

TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 6360 

TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 6420 
ATAGCATCAC AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT ' 64 80 

CCAAACTCAT CAATGTATCT TATCATGTCT GGATCGGCTG GATGATCCTC CAGCGCGGGG 654 0 

ATCTCATGCT GGAGTTCTTC GCCCACCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 6600 

AATAAAGCAA TAG CATC A CA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT 6660 

GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG TATACCGTCG ACCTCTAGCT 672 0 

AGAGCTTGGC GTAATCATGG TCATAGCTGT TTCCTGTGTG AAATTGTTAT CCGCTCACAA 678 0 

TTCCACACAA CATACGAGCC GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA 6 84 0 

GCTAACTCAC ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT 6900 

GCCAGCTGCA TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT 6 960 

CTTCCGCTTC CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT 7020 

CAGCTCACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC GCAGGAAAGA 7080 

ACATGTGAGC AAAAGGCCAG CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT 7140 

TTTTCCATAG GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 7200 

GGCGAAACCC GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 7260 

GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA 732~0 

GCGTGGCGCT TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG GTCGTTCGCT 7380 

CCAAGCTGGG CTGTGTGCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA 744 0 

ACTATCGTCT TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 7500 

GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 7560 

CTAACTACGG CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA 7620 

CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT GGTAGCGGTG 7680 

GTTTTTTTGT TTGCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT 774 0 

TGATCTTTTC TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 7800 

TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATTAAAAA TGAAGTTTTA 7 860 

AATCAATCTA AAGTATATAT GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG 7920 

AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG 7 980 

TGTAGATAAC TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC 8 04 0 

GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA CCAGCCAGCC GGAAGGGCCG 8100 

AGCGCAGAAG TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG 8160 

AAGCTAGAGT AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG 822 0 

GCATCGTGGT GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT 828 0 

CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC 834 0 

CGATCGTTGT CAGAAGTAAG TTGGCCGCAG TGTTATCACT CATGGTTATG GCAGCACTGC 8400 

ATAATTCTCT TACTGTCATG CCATCCGTAA GATGCTTTTC TGTGACTGGT GAGTACTCAA 84 60 

CCAAGTCATT CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC 852 0 

GGGATAATAC CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 8580 

CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC 864 0 

GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TGAGCAAAAA 8700 

CAGGAAGGCA AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAATACTCA 8760 

TACTCTTCCT TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT 882 0 

ACATATTTGA ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 8880 

AAGTGCCACC TGACGTC 8897 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GAAGAGGAAG ACTGACGGTG CCCCCGCGAG TTCAGGTGCT GAGG 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CCTCAGCACC TGAACTCGCG GGGGCACCGT CAGTCTTCCT CTTC 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTGGGAGGGC TTTGTTGGAG ACCGAGCACG AGTACGACTT GCCATTCAGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GATGGTTTTC TCGATGGCGG CTGGGAGGGC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GCCCTCCCAG CCGCCATCGA GAAAACCATC 

(2) INFORMATION FOR SEQ ID NO: 19: 



t 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GATGGTTTTC TCGATAGCGG CTGGGAGGGC TTTG 34 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

GATGGTTTTC TCGATGGCGG CTGGGAGGGC TTTGTTGGAG ACCGAGCACG AGTACGACTT 60 
GCCATTCAGC CAGTCCTGGT G 81 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

CACCAGGACT GGCTGAATGG CAAGTCGTAC TCGTGCTCGG TCTCCAACAA AGCCCTCCCA 60 
GCCGCCATCG AGAAAACCAT C 81 

(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

GGTACCAATT TAAATTGATA TCTCCTTAGG TCTCGAGTCT CTAGATAACC GGTCAATCGA 60 

TTGGAATTCT TGCGGCCGCT TGCTAGCCAC CATGGAGTTG TGGTTAAGCT TGGTCTTCCT 120 

TGTCCTTGTT TTAAAAGGTG TCCAGTGTGA AGTGCAACTG GTGGAGTCTG GGGGAGGCTT 180 

AGTGCAGCCT GGAGGGTCCC TGCGACTTTC CTGTGCTGCA TCTGGATTCC CGTTCAGTGA 24 0 

CTATTACATG TATTGGGTTC GCCAGGCTCC AGGCAAGGGA CTGGAGTGGG TCTCATACAT 3 00 



TAGTCAAGAT GGTGATATAA CCGACTATGC AGACTCCGTA AAGGGTCGAT TCACCATCTC 3 60 

CAGAGACAAT GCAAAGAACA GCCTGTACCT GCAAATGAAC AGCCTGAGGG ACGAGGACAC 4 20 

AGCCGTGTAT TACTGTGCAA GAGGCCTGGC GGACGGGGCC TGGTTTGCTT ACTGGGGCCA 4 80 

AGGGACTCTG GTCACGGTCT CTTCCGCTAG CACCAAGGGC CCATCGGTCT TCCCCCTGGC 540 

ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG TCAAGGACTA 600 

CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC 660 

CTTCCCGGCT GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TCACCGTGCC 720 

CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC 7 80 

CAAGGTGGAC AAGAAAGTTG GTGAGAGGCC AGCACAGGGA GGGAGGGTGT CTGCTGGAAG 840 

CCAGGCTCAG CGCTCCTGCC TGGACGCATC CCGGCTATGC AGCCCCAGTC CAGGGCAGCA 900 

AGGCAGGCCC CGTCTGCCTC TTCACCCGGA GGCCTCTGCC CGCCCCACTC ATGCTCAGGG 960 

AGAGGGTCTT CTGGCTTTTT CCCCAGGCTC TGGGCAGGCA CAGGCTAGGT GCCCCTAACC 1020 

CAGGCCCTGC ACACAAAGGG GCAGGTGCTG GGCTCAGACC TGCCAAGAGC CATATCCGGG 1080 

AGGACCCTGC CCCTGACCTA AGCCCACCCC AAAGGCCAAA CTCTCCACTC CCTCAGCTCG 114 0 

GACACCTTCT CTCCTCCCAG ATTCCAGTAA CTCCCAATCT TCTCTCTGCA GAGCCCAAAT 1200 

CTTGTGACAA AACTCACACA' TGCCCACCGT GCCCAGGTAA GCCAGCCCAG GCCTCGCCCT 1260 

CCAGCTCAAG GCGGGACAGG TGCCCTAGAG TAGCCTGCAT CCAGGGACAG GCCCCAGCCG 13 20 

GGTGCTGACA CGTCCACCTC CATCTCTTCC TCAGCACCTG AACTCCTGGG GGGACCGTCA 13 80 

GTCTTCCTCT TCCCCCCAAA ACCCAAGGAC ACCCTCATGA TCTCCCGGAC CCCTGAGGTC 1440 

ACATGCGTGG TGGTGGACGT GAG CCACGAA GACCCTGAGG TCAAGTTCAA CTGGTACGTG 1500 

GACGGCGTGG AGGTGCATAA TGCCAAGACA AAGCCGCGGG AGGAGCAGTA CAACAGCACG 1560 

TACCGTGTGG TCAGCGTCCT CACCGTCCTG CACCAGGACT GGCTGAATGG CAAGGAGTAC 1620 

AAGTGCAAGG TCTCCAACAA AGCCCTCCCA GCCCCCATCG AGAAAACCAT CTCCAAAGCC 1680 

AAAGGTGGGA CCCGTGGGGT GCGAGGGCCA CATGGACAGA GGCCGGCTCG GCCCACCCTC 174 0 

TGCCCTGAGA GTGACCGCTG TACCAACCTC TGTCCCTACA GGGCAGCCCC GAGAACCACA 1800 

GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG AACCAGGTCA GCCTGACCTG 1860 

CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG TGGGAGAGCA ATGGGCAGCC 1920 

GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC GACGGCTCCT TCTTCCTCTA 1980 

CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG AACGTCTTCT CATGCTCCGT 204 0 

GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC CTCTCCCTGT CTCCGGGTAA 2100 

ATGAGTGCGA CGGCCGGCAA GCCCCCGCTC CCCGGGCTCT CGCGGTCGCA CGAGGATGCT 2160 

TGGCACGTAC CCCCTGTACA TACTTCCCGG GCGCCCAGCA TGGAAATAAA GCACCCAGCG 2220 

CTGCCCTGGG CCCCTGCGAG ACTGTGATGG TTCTTTCCAC GGGTCAGGCC GAGTCTGAGG 22 80 

CCTGAGTGGC ATGAGGGAGG CAGAGCGGGT CCCACTGTCC CCACACTGGC CCAGGCTGTG 23 40 

CAGGTGTGCC TGGGCCCCCT AGGGTGGGGC TCAGCCAGGG GCTGCCCTCG GCAGGGTGGG 24 00 

GGATTTGCCA GCGTGGCCCT CCCTCCAGCA GCACCTGCCC TGGGCTGGGC CACGGGAAGC 2460 

CCTAGGAGCC CCTGGGGACA GACACACAGC CCCTGCCTCT GTAGGAGACT GTCCTGTTCT 2520 

GTGAGCGCCC CTGTCCTCCC GACCTCCATG CCCACTCGGG GGCATGCCTA GTCCATGTGC 2580 

GTAGGGACAG GCCCTCCCTC ACCCATCTAC CCCCACGGCA CTAACCCCTG GCTGCCCTGC 2640 

CCAGCCTCGC ACCCGCATGG GGACACAACC GACTCCGGGG ACATGCACTC TCGGGCCCTG 2700 

TGGAGGGACT GGTGCAGATG CCCACACACA CACTCAGCCC AGACCCGTTC AACAAACCCC 2760 

GCACTGAGGT TGGCCGGCCA CACGGCCACC ACACACACAC GTGCACGCCT CACACACGGA 2820 

GCCTCACCCG GGCGAACTGC ACAGCACCCA GACCAGAGCA AGGTCCTCGC ACACGTGAAC 28 80 

ACTCCTCGGA CACAGGCCCC CACGAGCCCC ACGCGGCACC TCAAGGCCCA CGAGCCTCTC 2940 

GGCAGCTTCT CCACATGCTG ACCTGCTCAG ACAAACCCAG CCCTCCTCTC ACAAGGGTGC 3 000 

CCCTGCAGCC GCCACACACA CACAGGGGAT CACACACCAC GTCACGTCCC TGGCCCTGGC 3 060 

CCACTTCCCA GTGCCGCCCT TCCCTGCAGG ACGGATCAGC CTCGACTGTG CCTTCTAGTT 3120 

GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3180 

CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 324 0 

CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 33 00 

GGCATGCTGG GGATGCGGTG GGCTCTATGG CTTCTGAGGC GGAAAGAACC AGCTGGGGCT 33 60 

CTAGGGGGTA TCCCCACGCG CCCTGTAGCG GCGCATTAAG CGCGGCGGGT GTGGTGGTTA 3420 

CGCGCAGCGT GACCGCTACA CTTGCCAGCG CCCTAGCGCC CGCTCCTTTC GCTTTCTTCC 34 80 

CTTCCTTTCt CGCCACGTTC GCCGGGCCTC TCAAAAAAGG GAAAAAAAGC ATGCATCTCA 3540 

ATTAGTCAGC AACCATAGTC CCGCCCCTAA CTCCGCCCAT CCCGCCCCTA ACTCCGCCCA 3600 

GTTCCGCCCA TTCTCCGCCC CATGGCTGAC TAATTTTTTT TATTTATGCA GAGGCCGAGG 3660 

CCGCCTCGGC CTCTGAGCTA TTCCAGAAGT AGTGAGGAGG CTTTTTTGGA GGCCTAGGCT 3720 

TTTGCAAAAA GCTTGGACAG CTCAGGGCTG CGATTTCGCG CCAAACTTGA CGGCAATCCT 3780 

AGCGTGAAGG CTGGTAGGAT TTTATCCCCG CTGCCATCAT GGTTCGACCA TTGAACTGCA 3 840 

TCGTCGCCGT GTCCCAAAAT ATGGGGATTG GCAAGAACGG AGACCTACCC TGGCCTCCGC 3 900 



TCAGGAACGA GTTCAAGTAC TTCCAAAGAA TGACCACAAC CTCTTCAGTG GAAGGTAAAC 3960 

AGAATCTGGT GATTATGGGT AGGAAAACCT GGTTCTCCAT TCCTGAGAAG AATCGACCTT 4020 

TAAAGGACAG AATTAATATA GTTCTCAGTA GAGAACTCAA AGAACCACCA CGAGGAGCTC 4080 

ATTTTCTTGC CAAAAGTTTG GATGATGCCT TAAGACTTAT TGAACAACCG GAATTGGCAA 4140 

GTAAAGTAGA CATGGTTTGG ATAGTCGGAG GCAGTTCTGT TTACCAGGAA GCCATGAATC 4200 

AACCAGGCCA CCTTAGACTC TTTGTGACAA GGATCATGCA GGAATTTGAA AGTGACACGT 42 60 

TTTTCCCAGA AATTGATTTG GGGAAATATA AACTTCTCCC AGAATACCCA GGCGTCCTCT 4320 

CTGAGGTCCA GGAGGAAAAA GGCATCAAGT ATAAGTTTGA AGTCTACGAG AAGAAAGACT 43 80 

AACAGGAAGA TGCTTTCAAG TTCTCTGCTC CCCTCCTAAA GCTATGCATT TTTATAAGAC 4440 

CATGGGACTT TTGCTGGCTT TAGATCTCTT TGTGAAGGAA CCTTACTTCT GTGGTGTGAC 4500 

ATAATTGGAC AAACTACCTA CAGAGATTTA AAGCTCTAAG GTAAATATAA AATTTTTAAG 4560 

TGTATAATGT GTTAAACTAC TGATTCTAAT TGTTTGTGTA TTTTAGATTC CAACCTATGG 4620 

AACTGATGAA TGGGAGCAGT GGTGGAATGC CTTTAATGAG GAAAACCTGT TTTGCTCAGA 46 80 

AGAAATGCCA TCTAGTGATG ATGAGGCTAC TGCTGACTCT CAACATTCTA CTCCTCCAAA 474 0 

AAAGAAGAGA AAGGTAGAAG ACCCCAAGGA CTTTCCTTCA GAATTGCTAA GTTTTTTGAG 48 00 

TCATGCTGTG TTTAGTAATA GAACTCTTGC TTGCTTTGCT ATTTACACCA CAAAGGAAAA 4860 

AGCTGCACTG CTATACAAGA AAATTATGGA AAAATATTCT GTAACCTTTA TAAGTAGGCA 4 920 

TAACAGTTAT AATCATAACA TACTGTTTTT TCTTACTCCA CACAGGCATA GAGTGTCTGC 4 980 

TATTAATAAC TATGCTCAAA AATTGTGTAC CTTTAGCTTT TTAATTTGTA AAGGGGTTAA 504 0 

TAAGGAATAT TTGATGTATA GTGCCTTGAC TAGAGATCAT AATCAGCCAT ACCACATTTG 5100 

TAGAGGTTTT ACTTGCTTTA AAAAACCTCC CACACCTCCC CCTGAACCTG AAACATAAAA 5160 

TGAATGCAAT TGTTGTTGTT AACTTGTTTA TTGCAGCTTA TAATGGTTAC AAATAAAGCA 5220 

ATAGCATCAC AAATTTCACA AATAAAGCAT TTTTTTCACT GCATTCTAGT TGTGGTTTGT 52 80 

CCAAACTCAT CAATGTATCT TATCATGTCT GGATCGGCTG GATGATCCTC CAGCGCGGGG 534 0 

ATCTCATGCT GGAGTTCTTC GCCCACCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 5400 

AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG CATTCTAGTT 5460 

GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG TATACCGTCG ACCTCTAGCT 5520 

AGAGCTTGGC GTAATCATGG TCATAGCTGT TTCCTGTGTG AAATTGTTAT CCGCTCACAA 5580 

TTCCACACAA CATACGAGCC GGAAGCATAA AGTGTAAAGC CTGGGGTGCC TAATGAGTGA 564 0 

GCTAACTCAC ATTAATTGCG TTGCGCTCAC TGCCCGCTTT CCAGTCGGGA AACCTGTCGT 5700 

GCCAGCTGCA TTAATGAATC GGCCAACGCG CGGGGAGAGG CGGTTTGCGT ATTGGGCGCT 5760 

CTTCCGCTTC CTCGCTCACT GACTCGCTGC GCTCGGTCGT TCGGCTGCGG CGAGCGGTAT 582 0 

CAGCTCACTC AAAGGCGGTA ATACGGTTAT CCACAGAATC AGGGGATAAC GCAGGAAAGA 5880 

ACATGTGAGC AAAAGGCCAG CAAAAGGCCA GGAACCGTAA AAAGGCCGCG TTGCTGGCGT 594 0 

TTTTCCATAG GCTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 6000 

GGCGAAACCC GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC 6060 

GCTCTCCTGT TCCGACCCTG' CCGCTTACCG GATACCTGTC CGCCTTTCTC CCTTCGGGAA 612 0 

GCGTGGCGCT TTCTCAATGC TCACGCTGTA GGTATCTCAG TTCGGTGTAG GTCGTTCGCT 6180 

CCAAGCTGGG CTGTGTGCAC GAACCCCCCG TTCAGCCCGA CCGCTGCGCC TTATCCGGTA 624 0 

ACTATCGTCT TGAGTCCAAC CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG 63 00 

GTAACAGGAT TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 6360 

CTAACTACGG CTACACTAGA AGGACAGTAT TTGGTATCTG CGCTCTGCTG AAGCCAGTTA 6420 

CCTTCGGAAA AAGAGTTGGT AGCTCTTGAT CCGGCAAACA AACCACCGCT GGTAGCGGTG 64 80 

GTTTTTTTGT TTGCAAGCAG CAGATTACGC GCAGAAAAAA AGGATCTCAA GAAGATCCTT 654 0 

TGATCTTTTC TACGGGGTCT GACGCTCAGT GGAACGAAAA CTCACGTTAA GGGATTTTGG 6600 

TCATGAGATT ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATTAAAAA TGAAGTTTTA 6660 

AATCAATCTA AAGTATATAT GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG 6720 

AGGCACCTAT CTCAGCGATC TGTCTATTTC GTTCATCCAT AGTTGCCTGA CTCCCCGTCG 6780 

TGTAGATAAC TACGATACGG GAGGGCTTAC CATCTGGCCC CAGTGCTGCA ATGATACCGC 684 0 

GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA CCAGCCAGCC GGAAGGGCCG 6900 

AGCGCAGAAG TGGTCCTGCA ACTTTATCCG CCTCCATCCA GTCTATTAAT TGTTGCCGGG 6960 

AAGCTAGAGT AAGTAGTTCG CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG 7020 

GCATCGTGGT GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGGT TCCCAACGAT 70 80 

CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC TTCGGTCCTC 7140 

CGATCGTTGT CAGAAGTAAG TTGGCCGCAG TGTTATCACT CATGGTTATG GCAGCACTGC 72 00 

ATAATTCTCT TACTGTCATG CCATCCGTAA GATGCTTTTC TGTGACTGGT GAGTACTCAA 7260 

CCAAGTCATT CTGAGAATAG TGTATGCGGC GACCGAGTTG CTCTTGCCCG GCGTCAATAC 73 20 

GGGATAATAC CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 73 80 

CGGGGCGAAA ACTCTCAAGG ATCTTACCGC TGTTGAGATC CAGTTCGATG TAACCCACTC 744 0 

GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG CGTTTCTGGG TGAGCAAAAA 7500 



\ 



CAGGAAGGCA AAATGCCGCA AAAAAGGGAA TAAGGGCGAC ACGGAAATGT TGAATACTCA 7560 

TACTCTTCCT TTTTCAATAT TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT 7620 

ACATATTTGA ATGTATTTAG AAAAATAAAC AAATAGGGGT TCCGCGCACA TTTCCCCGAA 7680 

AAGTGCCACC TGACGTCGAC GGATCGGGAG ATCTGCTAGG TGACCTGAGG CGCGCCGGCT 7740 

TCGAATAGCC AGAGTAACCT TTTTTTTTAA TTTTATTTTA TTTTATTTTT GAGATGGAGT 7800 

TTGGCGCCGA TCTCCCGATC CCCTATGGTC GACTCTCAGT ACAATCTGCT CTGATGCCGC 7860 

ATAGTTAAGC CAGTATCTGC TCCCTGCTTG TGTGTTGGAG GTCGCTGAGT AGTGCGCGAG 7920 

CAAAATTTAA GCTACAACAA GGCAAGGCTT GACCGACAAT TGCATGAAGA ATCTGCTTAG 7980 

GGTTAGGCGT TTTGCGCTGC TTCGCGATGT ACGGGCCAGA TATACGCGTT GACATTGATT 8040 

ATTGACTAGT TATTAATAGT AATCAATTAC GGGGTCATTA GTTCATAGCC CATATATGGA 8100 

GTTCCGCGTT ACATAACTTA CGGTAAATGG CCCGCCTGGC TGACCGCCCA ACGACCCCCG 8160 

CCCATTGACG TCAATAATGA CGTATGTTCC CATAGTAACG CCAATAGGGA CTTTCCATTG 8220 

ACGTCAATGG GTGGACTATT TACGGTAAAC TGCCCACTTG GCAGTACATC AAGTGTATCA 82 80 

TATGCCAAGT ACGCCCCCTA TTGACGTCAA TGACGGTAAA TGGCCCGCCT GGCATTATGC 8340 

CCAGTACATG ACCTTATGGG ACTTTCCTAC TTGGCAGTAC ATCTACGTAT TAGTCATCGC 84 00 

TATTACCATG GTGATGCGGT TTTGGCAGTA CATCAATGGG CGTGGATAGC GGTTTGACTC 8460 

ACGGGGATTT CCAAGTCTCC ACCCCATTGA CGTCAATGGG AGTTTGTTTT GGCACCAAAA 8520 

TCAACGGGAC TTTCCAAAAT GTCGTAACAA CTCCGCCCCA TTGACGCAAA TGGGCGGTAG 8580 

GCGTGTACGG TGGGAGGTCT ATATAAGCAG AGCTCTCTGG CTAACTAGAG AACCCACTGC 864 0 

TTACTGGCTT ATCGAAATTA ATACGACTCA CTATAGGGAG ACCCAAGCTT 86 90 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7874 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

GGTACCAATT TAAATTGATA TCTCCTTAGG TCTCGAGTCT CTAGATAACC GGTCAATCGA 60 

TTGGAATTCT TGCGGCCGCT TGCTAGCACC AAGGGCCCAT CGGTCTTCCC CCTGGCACCC 120 

TCCTCCAAGA GCACCTCTGG GGGCACAGCG GCCCTGGGCT GCCTGGTCAA GGACTACTTC 180 

CCCGAACCGG TGACGGTGTC GTGGAACTCA GGCGCCCTGA CCAGCGGCGT GCACACCTTC 240 

CCGGCTGTCC TACAGTCCTC AGGACTCTAC TCCCTCAGCA GCGTGGTCAC CGTGCCCTCC 3 00 

AGCAGCTTGG GCACCCAGAC CTACATCTGC AACGTGAATC ACAAGCCCAG CAACACCAAG 3 60 

GTGGACAAGA AAGTTGGTGA GAGGCCAGCA CAGGGAGGGA GGGTGTCTGC TGGAAGCCAG 420 

GCTCAGCGCT CCTGCCTGGA CGCATCCCGG CTATGCAGCC CCAGTCCAGG GCAGCAAGGC 4 80 

AGGCCCCGTC TGCCTCTTCA CCCGGAGGCC TCTGCCCGCC CCACTCATGC TCAGGGAGAG 540 

GGTCTTCTGG CTTTTTCCCC AGGCTCTGGG CAGGCACAGG CTAGGTGCCC CTAACCCAGG 600 

CCCTGCACAC AAAGGGGCAG GTGCTGGGCT CAGACCTGCC AAGAGCCATA TCCGGGAGGA 660 

CCCTGCCCCT GACCTAAGCC CACCCCAAAG GCCAAACTCT CCACTCCCTC AGCTCGGACA 720 

CCTTCTCTCC TCCCAGATTC CAGTAACTCC CAATCTTCTC TCTGCAGAGC CCAAATCTTG 780 

TGACAAAACT CACACATGCC CACCGTGCCC AGGTAAGCCA GCCCAGGCCT CGCCCTCCAG 84 0 

CTCAAGGCGG GACAGGTGCC CTAGAGTAGC CTGCATCCAG GGACAGGCCC CAGCCGGGTG 900 

CTGACACGTC CACCTCCATC TCTTCCTCAG CACCTGAACT CCTGGGGGGA CCGTCAGTCT 960 

TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT GAGGTCACAT 1020 

GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG TACGTGGACG 1080 

GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC AGCACGTACC 114 0 

GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG GAGTACAAGT 1200 

GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC AAAGCCAAAG 1260 

GTGGGACCCG TGGGGTGCGA GGGCCACATG GACAGAGGCC GGCTCGGCCC ACCCTCTGCC 13 20 

CTGAGAGTGA CCGCTGTACC AACCTCTGTC CCTACAGGGC AGCCCCGAGA ACCACAGGTG 13 80 

TACACCCTCC CCCCATCCCG GGATGAGCTG ACCAAGAACC AGGTCAGCCT GACCTGCCTG 1440 

GTCAAAGGCT TCTATCCCAG CGACATCGCC GTGGAGTGGG AGAGCAATGG GCAGCCGGAG 1500 

AACAACTACA AGACCACGCC TCCCGTGCTG GACTCCGACG GCTCCTTCTT CCTCTACAGC 1560 

AAGCTCACCG TGGACAAGAG CAGGTGGCAG CAGGGGAACG TCTTCTCATG CTCCGTGATG 1620 



CATGAGGCTC TGCACAACCA CTACACGCAG AAGAGCCTCT CCCTGTCTCC GGGTAAATGA 1680 

GTGCGACGGC CGGCAAGCCC CCGCTCCCCG GGCTCTCGCG GTCGCACGAG GATGCTTGGC 1740 

ACGTACCCCC TGTACATACT TCCCGGGCGC CCAGCATGGA AATAAAGCAC CCAGCGCTGC 1800 

CCTGGGCCCC TGCGAGACTG TGATGGTTCT TTCCACGGGT CAGGCCGAGT CTGAGGCCTG 1860 

AGTGGCATGA GGGAGGCAGA GCGGGTCCCA CTGTCCCCAC ACTGGCCCAG GCTGTGCAGG 1920 

TGTGCCTGGG CCCCCTAGGG TGGGGCTCAG CCAGGGGCTG CCCTCGGCAG GGTGGGGGAT 1980 

TTGCCAGCGT GGCCCTCCCT CCAGCAGCAC CTGCCCTGGG CTGGGCCACG GGAAGCCCTA 2040 

GGAGCCCCTG GGGACAGACA CACAGCCCCT GCCTCTGTAG GAGACTGTCC TGTTCTGTGA 2100 

GCGCCCCTGT CCTCCCGACC TCCATGCCCA CTCGGGGGCA TGCTGGGGAT GCGGTGGGCT 2160 

CTATGGCTTC TGAGGCGGAA AGAACCAGCT GGGGCTCTAG GGGGTATCCC CACGCGCCCT 2220 

GTAGCGGCGC ATTAAGCGCG GCGGGTGTGG TGGTTACGCG CAGCGTGACC GCTACACTTG 2280 

CCAGCGCCCT AGCGCCCGCT CCTTTCGCTT TCTTCCCTTC CTTTCTCGCC ACGTTCGCCG 2340 

GCTTTCCCCG TCAAGCTCTA AATCGGGGCA TCCCTTTAGG GTTCCGATTT AGTGCTTTAC 2400 

GGCACCTCGA CCCCAAAAAA CTTGATTAGG GTGATGGTTC ACGTAGTGGG CCATCGCCCT 24 60 

GATAGACGGT TTTTCGCCCT TTGACGTTGG AGTCCACGTT CTTTAATAGT GGACTCTTGT 2520 

TCCAAACTGG AACAACACTC AACCCTATCT CGGTCTATTC TTTTGATTTA TAAGGGATTT 25 80 

TGGGGATTTC GGCCTATTGG TTAAAAAATG AGCTGATTTA ACAAAAATTT AACGCGAATT 264 0 

AATTCTGTGG AATGTGTGTC AGTTAGGGTG TGGAAAGTCC CCAGGCTCCC CAGGCAGGCA 2700 

GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAT AGTCCCGCCC CTAACTCCGC 2760 

CCATCCCGCC CCTAACTCCG CCCAGTTCCG CCCATTCTCC GCCCCATGGC TGACTAATTT 2820 

TTTTTATTTA TGCAGAGGCC GAGGCCGCCT CGGCCTCTGA GCTATTCCAG AAGTAGTGAG 28 80 

GAGGCTTTTT TGGAGGCCTA GGCTTTTGCA AAAAGCTTGG ACAGCTCAGG GCTGCGATTT 2940 

CGCGCCAAAC TTGACGGCAA TCCTAGCGTG AAGGCTGGTA GGATTTTATC CCCGCTGCCA 3000 

TCATGGTTCG ACCATTGAAC TGCATCGTCG CCGTGTCCCA AAATATGGGG ATTGGCAAGA 3060 

ACGGAGACCT ACCCTGGCCT CCGCTCAGGA ACGAGTTCAA GTACTTCCAA AGAATGACCA 3120 

CAACCTCTTC AGTGGAAGGT AAACAGAATC TGGTGATTAT GGGTAGGAAA ACCTGGTTCT 3180 

CCATTCCTGA GAAGAATCGA CCTTTAAAGG ACAGAATTAA TATAGTTCTC AGTAGAGAAC 3240 

TCAAAGAACC ACCACGAGGA GCTCATTTTC TTGCCAAAAG TTTGGATGAT GCCTTAAGAC 33 00 

TTATTGAACA ACCGGAATTG GCAAGTAAAG TAGACATGGT TTGGATAGTC GGAGGCAGTT 33 60 

CTGTTTACCA GGAAGCCATG AATCAACCAG GCCACCTTAG ACTCTTTGTG ACAAGGATCA 3420 

TGCAGGAATT TGAAAGTGAC ACGTTTTTCC CAGAAATTGA TTTGGGGAAA TATAAACTTC 34 80 

TCCCAGAATA CCCAGGCGTC CTCTCTGAGG TCCAGGAGGA AAAAGGCATC AAGTATAAGT 3540 

TTGAAGTCTA CGAGAAGAAA GACTAACAGG AAGATGCTTT CAAGTTCTCT GCTCCCCTCC 3600 

TAAAGCTATG CATTTTTATA AGACCATGGG ACTTTTGCTG GCTTTAGATC TCTTTGTGAA 3660 

GGAACCTTAC TTCTGTGGTG TGACATAATT GGACAAACTA CCTACAGAGA TTTAAAGCTC 3720 

TAAGGTAAAT ATAAAATTTT TAAGTGTATA ATGTGTTAAA CTACTGATTC TAATTGTTTG 3780 

TGTATTTTAG ATTCCAACCT ATGGAACTGA TGAATGGGAG CAGTGGTGGA ATGCCTTTAA 3840 

TGAGGAAAAC CTGTTTTGCT CAGAAGAAAT GCCATCTAGT GATGATGAGG CTACTGCTGA 3900 

CTCTCAACAT TCTACTCCTC CAAAAAAGAA GAGAAAGGTA GAAGACCCCA AGGACTTTCC 3960 

TTCAGAATTG CTAAGTTTTT TGAGTCATGC TGTGTTTAGT AATAGAACTC TTGCTTGCTT '4 020 

TGCTATTTAC ACCACAAAGG AAAAAGCTGC ACTGCTATAC AAGAAAATTA TGGAAAAATA 4 080 

TTCTGTAACC TTTATAAGTA GGCATAACAG TTATAATCAT AACATACTGT TTTTTCTTAC 4140 

TCCACACAGG CATAGAGTGT CTGCTATTAA TAACTATGCT CAAAAATTGT GTACCTTTAG 4200 

CTTTTTAATT TGTAAAGGGG TTAATAAGGA ATATTTGATG TATAGTGCCT TGACTAGAGA 4 260 

TCATAATCAG CCATACCACA TTTGTAGAGG TTTTACTTGC TTTAAAAAAC CTCCCACACC 4 320 

TCCCCCTGAA CCTGAAACAT AAAATGAATG CAATTGTTGT TGTTAACTTG TTTATTGCAG 43 80 

CTTATAATGG TTACAAATAA AGCAATAGCA TCACAAATTT CACAAATAAA GCATTTTTTT 444 0 

CACTGCATTC TAGTTGTGGT TTGTCCAAAC TCATCAATGT ATCTTATCAT GTCTGGATCG 4500 

GCTGGATGAT CCTCCAGCGC GGGGATCTCA TGCTGGAGTT CTTCGCCCAC CCCAACTTGT 4 560 

TTATTGCAGC TTATAATGGT TACAAATAAA GCAATAGCAT CACAAATTTC ACAAATAAAG 4620 

CATTTTTTTC ACTGCATTCT AGTTGTGGTT TGTCCAAACT CATCAATGTA TCTTATCATG 4680 

TCTGTATACC GTCGACCTCT AGCTAGAGCT TGGCGTAATC ATGGTCATAG CTGTTTCCTG 4 74 0 

TGTGAAATTG TTATCCGCTC ACAATTCCAC ACAACATACG AGCCGGAAGC ATAAAGTGTA 4800 

AAGCCTGGGG TGCCTAATGA GTGAGCTAAC TCACATTAAT TGCGTTGCGC TCACTGCCCG 4 860 

CTTTCCAGTC GGGAAACCTG TCGTGCCAGC TGCATTAATG AATCGGCCAA CGCGCGGGGA 4920 

GAGGCGGTTT GCGTATTGGG CGCTCTTCCG CTTCCTCGCT CACTGACTCG CTGCGCTCGG 4980 

TCGTTCGGCT GCGGCGAGCG GTATCAGCTC ACTCAAAGGC GGTAATACGG TTATCCACAG 5040 

AATCAGGGGA TAACGCAGGA AAGAACATGT GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 5100 

GTAAAAAGGC CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG CCCCCCTGAC GAGCATCACA 5160 

AAAATCGACG CTCAAGTCAG AGGTGGCGAA ACCCGACAGG ACTATAAAGA TACCAGGCGT 5220 



TTCCCCCTGG AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC CCTGCCGCTT ACCGGATACC 52 80 

TGTCCGCCTT TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA ATGCTCACGC TGTAGGTATC 5340 

TCAGTTCGGT GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT GCACGAACCC CCCGTTCAGC 54 00 

CCGACCGCTG CGCCTTATCC GGTAACTATC GTCTTGAGTC CAACCCGGTA AGACACGACT 54 60 

TATCGCCACT GGCAGCAGCC ACTGGTAACA GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 5520 

CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC TAGAAGGACA GTATTTGGTA 5580 

TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 5640 

AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 5700 

AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG GTCTGACGCT CAGTGGAACG 5760 

AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA AAGGATCTTC ACCTAGATCC 5820 

TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 58 80 

ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC GATCTGTCTA TTTCGTTCAT 594 0 

CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT ACGGGAGGGC TTACCATCTG 6000 

GCCCCAGTGC TGCAATGATA CCGCGAGACC CACGCTCACC GGCTCCAGAT TTATCAGCAA 6060 

TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 6120 

TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 6180 

GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 6240 

CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 6300 

AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC GCAGTGTTAT 6360 

CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC GTAAGATGCT 6420 

TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA ATAGTGTATG CGGCGACCGA 64 80 

GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC ACATAGCAGA ACTTTAAAAG 6540 

TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC AAGGATCTTA CCGCTGTTGA 6600 

GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC TTCAGCATCT TTTACTTTCA 6660 

CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC CGCAAAAAAG GGAATAAGGG 6720 

CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA ATATTATTGA AGCATTTATC 6780 

AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT TTAGAAAAAT AAACAAATAG 6840 

GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGACGT CGACGGATCG GGAGATCTGC 6900 

TAGGTGACCT GAGGCGCGCC GGCTTCGAAT AGCCAGAGTA ACCTTTTTTT TTAATTTTAT 6960 

TTTATTTTAT TTTTGAGATG GAGTTTGGCG CCGATCTCCC GATCCCCTAT GGTCGACTCT 7020 

CAGTACAATC TGCTCTGATG CCGCATAGTT AAGCCAGTAT CTGCTCCCTG CTTGTGTGTT 7080 

GGAGGTCGCT GAGTAGTGCG CGAGCAAAAT TTAAGCTACA ACAAGGCAAG GCTTGACCGA 7140 

CAATTGCATG AAGAATCTGC TTAGGGTTAG GCGTTTTGCG CTGCTTCGCG ATGTACGGGC 7200 

CAGATATACG CGTTGACATT GATTATTGAC TAGTTATTAA TAGTAATCAA TTACGGGGTC 7260 

ATTAGTTCAT AGCCCATATA TGGAGTTCCG CGTTACATAA CTTACGGTAA ATGGCCCGCC 7320 

TGGCTGACCG CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 7380 

AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAC TATTTACGGT AAACTGCCCA 744 0 

CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC CCTATTGACG TCAATGACGG 7500 

TAAATGGCCC GCCTGGCATT ATGCCCAGTA CATGACCTTA TGGGACTTTC CTACTTGGCA 7560 

GTACATCTAC GTATTAGTCA TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA 7620 

TGGGCGTGGA TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 7680 

TGGGAGTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA ACAACTCCGC 7740 

CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG GTCTATATAA GCAGAGCTCT 7800 

CTGGCTAACT AGAGAACCCA CTGCTTACTG GCTTATCGAA ATTAATACGA CTCACTATAG 7860 

GGAGACCCAA GCTT 7874 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

15 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Pro Phe Ser Asp Tyr 







20 










25 










30 






Tyr 


Met Tyr 


Trp 


Val 


Arg 


Gin 


Ala 


Pro 


Gly Lys 


Gly 


Leu 


Glu 


Trp 


Val 




35 










40 










45 








Ser 


Tyr lie 


Ser 


Gin 


Asp 


Gly 


Asp 


He 


Thr 


Asp 


Tyr Ala 


Asp 


Ser 


Val 




50 








55 










60 










Lys 


Gly Arg 


Phe 


Thr 


He 


Ser 


Arg 


Asp 


Asn 


Ala 


Lys 


Asn 


Ser 


Leu 


Tyr 


65 








70 










75 










80 


Leu 


Gin Met 


Asn 


Ser 


Leu 


Arg 


Asp 


Glu 


Asp 


Thr 


Ala 


Val 


Tyr 


Tyr 


Cys 








85 










90 










95 




Ala 


Arg Gly Leu 


Ala 


Asp 


Gly 


Ala 


Trp 


Phe 


Ala 


Tyr 


Trp 


Gly 


Gin 


Gly 






100 










105 










110 






Thr 


Leu Val 


Thr 


Val 


Ser 


Ser 





















115 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 330 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



Ala Ser Thr 
1 

Ser Thr Ser 

Phe Pro Glu 
35 

Gly Val His 
50 

Leu Ser Ser 
65 

Tyr He Cys 

Lys Val Glu 

Pro Ala Pro 
115 

Lys Pro Lys 
130 

Val Val Val 
145 

Tyr Val Asp 

Glu Gin Tyr 

His Gin Asp 
195 

Lys Ala Leu 
210 

Gin Pro Arg 
225 

Leu Thr Lys 
Pro Ser Asp 



Lys Gly 
5 

Gly Gly 
20 

Pro Val 

Thr Phe 

Val Val 

Asn Val 

85 
Pro Lys 
100 

Glu Leu 

Asp Thr 

Asp Val 

Gly Val 
165 
Asn Ser 
180 

Trp Leu 

Pro Ala 

Glu Pro 

Asn Gin 
245 
He Ala 
260 



Pro Ser Val Phe 



Thr Ala Ala 

Thr Val Ser 
40 

Pro Ala Val 
55 

Thr Val Pro 
70 

Asn His Lys 

Ser Cys Asp 

Leu Gly Gly 
120 

Leu Met He 

135 
Ser His Glu 
150 

Glu Val His 

Thr Tyr Arg 

Asn Gly Lys 
200 

Pro He Glu 

215 
Gin Val Tyr 
230 

Val Ser Leu 
Val Glu Trp 



Leu 

25 

Trp 

Leu 

Ser 

Pro 

Lys 
105 
Pro 

Ser 

Asp 

Asn 

Val 
185 
Glu 

Lys 

Thr 

Thr 

Glu 
265 



Pro Leu Ala Pro 
10 

Gly Cys Leu Val 

Asn Ser Gly Ala 

45 

Gin Ser Ser Gly 
60 

Ser Ser Leu Gly 
75 

Ser Asn Thr Lys 
90 

Thr His Thr Cys 

Ser Val Phe Leu 

125 

Arg Thr Pro Glu 
140 

Pro Glu Val Lys 
155 

Ala Lys Thr Lys 
170 

Val Ser Val Leu 



Tyr 

Thr 

Leu 

Cys 
250 
Ser 



Lys Asp Lys 
205 

He Ser Lys 

220 
Pro Pro Ser 
235 

Leu Val Lys 
Asn Gly Gin 



Ser Ser Lys 
15 

Lys Asp Tyr 
30 

Leu Thr Ser 

Leu Tyr Ser 

Thr Gin Thr 
80 

Val Asp Lys 
95 

Pro Pro Cys 
110 

Phe Pro Pro 

Val Thr Cys 

Phe Asn Trp 
160 

Pro Arg Glu 
175 

Thr Val Leu 
190 

Val Ser Asn 

Ala Lys Gly 

Arg Asp Glu 
240 

Gly Phe Tyr 

255 
Pro Glu Asn 
270 



Asn Tyr Lys Thr Thr 
275 

Leu Tyr Ser Lys Leu 
290 

Val Phe Ser Cys Ser 
305 

Gin Lys Ser Leu Ser 

325 



Pro Pro Val Leu Asp Ser 
280 

Thr Val Asp Lys Ser Arg 
295 

Val Met His Glu Ala Leu 
310 315 
Leu Ser Pro Gly Lys 

330 



Asp Gly Ser Phe Phe 
285 

Trp Gin Gin Gly Asn 
300 

His Asn His Tyr Thr 

320 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



Ala Ser 
1 

Ser Thr 

Phe Pro 

Gly Val 

50 
Leu Ser 
65 

Tyr lie 

Lys Val 

Pro Gly 

Asp Glu 
130 
Phe Tyr 
145 

Glu Asn 

Phe Phe 

Gly Asn 

Tyr Thr 
210 



Thr Lys 

Ser Gly 

20 
Glu Pro 
35 

His Thr 

Ser Val 

Cys Asn 

Glu Pro 
100 
Gin Pro 
115 

Leu Thr 

Pro Ser 

Asn Tyr 

Leu Tyr 
180 
Val Phe 
195 

Gin Lys 



Gly Pro 
5 

Gly Thr 

Val Thr 

Phe Pro 

Val Thr 
70 

Val Asn 
85 

Lys Ser 

Arg Glu 

Lys Asn 

Asp lie 
150 
Lys Thr 
165 

Ser Lys 
Ser Cys 
Ser Leu 



Ser Val 

Ala Ala 

Val Ser 

40 
Ala Val 
55 

Val Pro 

His Lys 

Cys Asp 

Pro Gin 
120 
Gin Val 
135 

Ala Val 

Thr Pro 

Leu Thr 

Ser Val 
200 
Ser Leu 
215 



Phe 

Leu 

25 

Trp 

Leu 

Ser 

Pro 

Lys 
105 
Val 

Ser 

Glu 

Pro 

Val 
185 
Met 

Ser 



Pro Leu Ala Pro 
10 

Gly Cys Leu Val 

Asn Ser Gly Ala 

45 

Gin Ser Ser Gly 
60 

Ser Ser Leu Gly 
75 

Ser Asn Thr Lys 
90 

Thr His Thr Cys 



Tyr 

Leu 

Trp 

Val 
170 
Asp 



Thr Leu Pro 
125 

Thr Cys Leu 
140 

Glu Ser Asn 
155 

Leu Asp Ser 
Lys Ser Arg 



His Glu Ala Leu 

205 

Pro Gly Lys 
220 



Ser Ser Lys 
15 

Lys Asp Tyr 
30 

Leu Thr Ser 

Leu Tyr Ser 

Thr Gin Thr 
80 

Val Asp Lys 
95 

Pro Pro Cys 
110 

Pro Ser Arg 

Val Lys Gly 

Gly Gin Pro 
160 

Asp Gly Ser 
175 

Trp. Gin Gin 
190 

His Asn His 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Glu Val Asn Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly Gly 

15 10 15 

Ser Leu Lys Val Ser Cys Val Thr Ser Gly Phe Thr Phe Ser Asp Tyr 

20 25 30 

Tyr Met Tyr Trp Val Arg Gin Thr Pro Glu Lys Arg Leu Glu Trp Val 

35 40 45 

Ala Tyr lie Ser Gin Gly Gly Asp lie Thr Asp Tyr Pro Asp Thr Val 

50 55 60 

Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr 
65 70 75 80 

Leu Gin Met Ser Arg Leu Lys Ser Glu Asp Thr Ala Met Tyr Tyr Cys 

85 90 95 

Ala Arg Gly Leu Asp Asp Gly Ala Trp Phe Ala Tyr Trp Gly Gin Gly 

100 105 110 

Thr Leu Val Thr Val Ser Val Ala Ser Thr Lys Gly Pro Ser Val Phe 

115 120 125 

Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu 

130 135 140 

Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp 
145 150 155 160 

Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu 

165 170 175 

Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser 

180 185 190 

Ser Ser Leu Gly Thr Gin Thr Tyr lie Cys Asn Val Asn His Lys Pro 

195 200 205 

Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys Asp Lys 

210 215 220 

Thr His Thr Cys Pro Pro Cys Pro Gly Gin Pro Arg Glu Pro Gin Val 
225 230 235 240 

Tyr Thr Leu Pro Pro Ser Arg Asp Glu Leu Thr Lys Asn Gin Val Ser 

245 250 255 

Leu Thr Cys Leu Val Lys Gly Phe Tyr Pro Ser Asp lie Ala Val Glu 

260 265 270 

Trp Glu Ser Asn Gly Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro 

275 280 285 

Val Leu Asp Ser Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val 

290 295 300 

Asp Lys Ser Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met 
305 310 315 320 

His Glu Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser 

325 330 335 

Pro Gly Lys 



